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Preface 


The content of this book was developed over more than 15 years of teaching the course 
“Introduction to Systems Biology”, first at the University of Stuttgart and then mainly at 
the University of Luxembourg. This course aims to introduce key mathematical concepts 
of systems biology to students with mainly biology backgrounds. Easily accessible toy 
examples are used to illustrate these concepts in a straightforward way. Some of these 
examples, as well as some of the ideas in the book, come from colleagues, whom we 
would like to thank very much for sharing their work. 

Over the years, the course style changed from traditional classroom 
teaching—with lectures on the concepts and demonstrations of exercise solutions—to 
more self-paced and interactive learning using the flipped-classroom method (see the 
Introduction of this book). This usually consisted of a short kick-off lecture emphasizing 
the key concepts briefly and answering some general questions of the class. The 
remainder of the day was then organized into flexible group work in class with the 
support of tutors, and independent study time (usually in the afternoons). This allowed 
the students to progress at their own pace and to support each other. Final exam results 
improved by around 2 points on a scale of 20 as a result of this new method. 

The course was complemented with talks about current research questions and 
examples of the lab or the field in general. These talks were either given by me (Thomas 
Sauter) or by the assisting postdoctoral and PhD students. Within the curriculum of the 
Master’s in Integrated Systems Biology at the University of Luxembourg, this course was 
followed by 2 practical computational courses, where the students applied the introduced 
mathematical concepts to self-designed and self-executed projects. These project-based 
learning courses focused on metabolic network modelling using constraint-based 
modelling (see Chapter 2 of this book) and on pharmacokinetic (PK) modelling using 
ordinary differential equations (see Chapter 3 of this book). The structure of these 
courses, along with some illustrative example projects, is detailed in the article “Project- 
Based Learning Course on Metabolic Network Modelling in Computational Systems 
Biology” (Sauter et al., 2022)!. The combination of studying the theory at one’s own pace 


and applying it to self-designed projects has proven to be an effective way of learning. 


' PLoS Comput Biol 2022 Jan 27; 18(1):e1009711, https://doi.org/10.1371/journal.pcbi.1009711. 
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Suggestions and corrections are very welcome (by email to: thomas.sauter@uni.lu) 


and will be considered for the next edition of this book. 


On a personal note, I would like to take this opportunity to express my 
thankfulness to my parents—your love and hard work have laid the foundation for my 
career—and to my family: Sabine, it is so precious to have you by my side. Josephine, it 
is great to see you growing up and shining. And Leonard, I am grateful for our days 


together. You were the first to see this book. 


Thomas Sauter, Nittel & Belval, October 2022 
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Introduction 


Thomas Sauter, Marco Albrecht 


Motivation 
In this book, you will learn how mathematical models of biological networks are built and how the analysis of 
such models help to understand the system-level properties of networks. The book will introduce you to the 
language of systems biology which needs to be spoken among biologists, physicists, computer scientists, and 
engineers in the interdisciplinary research environment of bio-medicine. Science is about what is; Engineering 
is about what can be. Combining both will enrich your profile as an academic and enrich your view of the 
world around us. We are on the brink of the era of network medicine. This novel approach has the potential to 


revolutionize and personalize the treatment of patients. This book focuses on some of the fundamental concepts 
which are essential to developing successful network medicine approaches in the upcoming years. We hope you 
enjoy reading this book as much as we enjoyed writing it. 
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network-based drug discovery, with applications mainly 
in cancer biology. He has more than 20 years of experi- 
ence in educating and supporting students. 


Marco Albrecht is an engineer, 
trained in system theory, control 
engineering, modeling, and molec- 
ular biology. He studied bio- 
systems engineering at the Otto- 
von-Guericke University in Magde- 
burg and did a PhD at the Uni- 
versity of Luxembourg on “Math- 
ematical histopathology and sys- 
tems pharmacology of melanoma” in the context of the 
MELPLEX ITN training program supported by the Eu- 
ropean HORIZON 2020. He is now a research scientist 
at esqLABS GmbH, Germany, with expertise in Quanti- 
tative Systems Pharmacology. 


2. Overview 


Complex systems can be found in many fields, and re- 
searchers in biology take ever more advantage of this 
and related concepts shown in Figure 1. The concepts are 
now reaching the realm of medicine and also raise sev- 
eral challenges for data integration. We suggest reading 
the paper on systems medicine [1] which is summarized 
in Figure 2. Some of these concepts will be explained in 
this book "Introduction to Systems Biology". 

Many computational courses rely on linear algebra 
and other mathematical concepts. Consequently, it will 
be very important to pay sufficient attention to these 
mathematical basics. We incorporated a good share of 
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Figure 1. Complex systems organizational map. Created by H. Sayama, Collective Dynamics of Complex Systems 
Research Group at Binghamton University, New York. Wikimedia. Licence: CC BY 4.0. 


it in this book, as you will see in the following chapters. 
But first we would like to make some remarks about 
the design of the book and the respective course on "In- 
troduction to Systems Biology". We therefore review 
the research evidence for effective learning and reveal 
pitfalls which might emerge within an interdisciplinary 
study program. We also give you lists with small and 
prioritized learning units, which you can tick off step by 
step. This book contains several links to YouTube videos. 
Check them out by clicking on the link: 

YouTube: Systems Thinking 


3. Planning 


We divided this course into four parts with increasing 
levels of modeling detail, shown in Figure 3. 

The detailed content is specified in the learning check- 
list on page 8. Here, we give here a rough overview of 
what we want to achieve. 


Course aims (what): 


e Gain confidence in the step-wise calculation of 
mathematical problems. 


e Connect mathematical concepts to biological real- 
world problems. 


e Enable efficient communication between biology 
and computational disciplines. 


Course goals (how): 


e We demonstrate the step-wise calculation in this 
book and with the help of YouTube videos. 


e We connect theoretical approaches with real-world 
biology. 


e We explain the geometrical intuition behind math- 
ematical operations. 
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Figure 2. Systems medicine: Brown: conventional approach. Green: data flow. Blue: information flow. Source: [1]. 
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This book has been developed for a full-time two-week 
course following the flipped-classroom approach which 
we will introduce in the next section. 


4, Learning 


The following insights come from educational studies 
[2]. Social scientists compare between-group differences 
and within-group differences with the measure Cohens 
d. A Cohens d = 0.5 means that the difference between 
groups is half the difference within groups. Social scien- 
tists interpret d as follows: 


d~0.2 : small effect 


d~0.5 : medium effect 


Part 1 


steady state 


Part 2 


steady state 


network models metabolic networks 


linear algebra stoichiometric matrix 


Part 3 
dynamic change of the system 


elements 


systems science 
control, hysteresis 


Figure 3. Advance organizer of this book. 


d 0.8 : strong effect 


with the hint that even minor effects can become relevant 
in combination with others. 


Self-efficacy belief and regulation of effort 


The most effective attitude is the self-efficacy belief (d = 
1.81). Accordingly, we have organized the course in a 
way that ensures you have the most flexibility in tackling 
the problems on your own, and we will try to support 
you. We have tried to give clear objectives, prioritize 
the different tasks and optimize the course structure 
to help you progress fast without losing time. These 
precautions also complement your ability to regulate the 
effort by yourself (d = 0.75). Your positive energy and 
willingness to master this ambitious course will make 
the biggest impact beyond anything we can do. Passing 
this course gives you a great feeling of accomplishment 
and a new view of biology. With the right practice and 
the belief that you can make the most difference, you 
have the key to success in your own hands. This effect 
size is very strong and compensates for differences in 
talent, intelligence, and unchangeable traits to a large 
extent. Personality, intelligence, gender, time of year and 
working hours (for an office-based job) are altogether 
minor effects (—0.24 < d < 0.32). Intelligence explains 
4% of the exam results. Joy, pride, and hope (—0.24 < 
d < 0.32) are more productive attitudes than anger, fear, 
and charm (—0.8 < d < —0.28). 
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Effective practice in an interdisciplinary environment 


The success of teaching various learning strategies de- 
clines from elementary school (d = 0.92) to university 
(d = 0.28), which can be explained by the supposition 
that students learn which strategy is best for achieving 
results over time. However, learning strategies are highly 
subject dependent and can hardly ever be transferred to 
other disciplines. Studying concepts in biology requires 
the memorization of many facts to achieve a sufficient 
knowledge base. A huge amount of initially unrelated 
facts have to be learnt in order to interpret new obser- 
vations, design experiments, and understand relation- 
ships. Mathematics and engineering, however, require 
the memorization of a few and simple basic concepts 
with which they construct their theories. Only axioms 
and basic equations must be learned. The challenge is to 
apply those concepts to different cases and tasks. Some 
tasks seem simple but can be unsolvable problems, while 
other, seemingly more complicated equation sets, can 
turn out to be easy. Getting a feeling for the underlying 
approaches in each discipline takes time. Biologists usu- 
ally have to make countless observations and deconstruct 
things in order to understand their origin. In contrast, 
engineers combine different elements to build something 
up and to achieve a certain behavior. Engineers combine 
problem-dependent modules of equations together to 
represent desired or natural systems and their behavior. 
In contrast, physicists always search for a simple underly- 
ing equation to help them understand nature itself. Com- 
puter scientists, bio-informaticians etc structure, handle, 
and store data by automating procedures according to 
the wishes of a user without the inner motivation to 
understand nature itself. A new problem can confront 
computational scientists with the time-consuming need 
to develop new software. Once this step is solved, the 
computational running time for solving the actual prob- 
lem might be low. Therefore, they always search for 
pre-developed software modules and libraries. The gen- 
eration of data in biology is much more incremental and 
steady, partly because the problems and tasks are often 
unique. Thus, the general thinking and research prac- 
tices of different scientists can contrast. Synthetic biology 
is a sub-discipline of biology which resembles the think- 
ing in engineering the most. To engineers, it might be 
helpful to say that they have to solve a so-called inverse 
problem, which is the most frequent problem biologists 
face. This is a very sharp separation of different think- 
ing schools, and you will see that scientists can have a 
mixture of those approaches but it might help to recog- 
nize problems of misunderstanding. Neither of these 
is wrong, nor better than the other. The problems they 
tackle have simply moulded their way of thinking to the 
optimal mode for the discipline, which would probably 
fail if applied to another area. 
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Do not underestimate the amount of effort required 
to learn mathematics. Concepts make up around 20-30% 
of your learning time and 70-80% of your time will be 
necessarily devoted to solving equations and tasks on 
your own. This can be best compared with your lab 
work. The more you can automatize isolated tasks like 
media preparation and pipetting, the more capacity is 
free to solve more comprehensive and complex working 
schedules in the lab. Time set aside for practice is im- 
portant (see Figure 4). In the beginning, you will work 
through several subparts of a task, but one individual 
subtask might still limiting your overall performance. 
This can be frustrating—for example, if one learns a new 
programming language. At first, it seems unfathomable, 
but you can make more progress than you think. After 
you have reached a certain level, you will progress very 
fast. At the upper level, you will become so proficient 
that the improvements seem to slow down as they are 
not recognizable anymore. At this stage, expert feedback 
is necessary to help you recognize flaws and find new 
challenges to work on. 
Additionally, having willingness to solve the given prob- 
lem with different approaches, whatever it takes, is a 
good trait to become a good computational scientist. We 
provide you with the solutions directly to give you more 
responsibility, but do not look at the solution immediately— 
only if you get stuck for a long time. You have to improve 
your skills, not just your knowledge. One also has to 
frequently change between studying concepts and prac- 
ticing in order to progress. Some formulations might be 
circuitous at first glimpse, but become more understand- 
able after solving tasks. But don’t worry, the purpose of 
this course is an introduction to computational problem- 
solving and many difficulties remain even in physics, 
mathematics, and engineering schools, where years are 
dedicated to solving such tasks. Much of what you learn 


late 


mid 


performance 


early 


time practicing 

Figure 4. Performance gain in complex tasks can seem 
nonlinear. When too many uncertainties in subtasks 
hamper overall performance, sufficient practice time ac- 
celerates performance until it flattens down again. Do 
not give up too quickly. You never know when you will 
skyrocket. Source: [3]. Copyright © 2010, John Wiley 
and Sons. 


in the course is comparable with learning a sequence of 
activities not far from following a cooking recipe. You 
will manage it! Because we integrate active learning 
sessions, we will likely reduce the failure rate. Tradi- 
tional lecturing would increase the failure rate by 55% 
in science, engineering, and mathematics [4]. 


How to study engineering, math, and physics 

We have some tips for studying courses with many equa- 
tions. Our book will be somewhat between a classical 
biology and typical engineering text. 

About understanding and learning 

It is quite favorable to tackle the material before the lec- 
ture. Concepts in mathematics, engineering, and physics 
are more or less always the same and do not change as 
fast as some concepts in biology. They are also not as 
comprehensive as in biology. But they are not so eas- 
ily accessible, because mathematical terms are made up 
of highly compressed knowledge. Lecturers in biology 
more often use PowerPoint presentations to transmit 
the knowledge, while lecturers in engineering use the 
blackboard to slow the knowledge transfer down. It is 
essential to see how things evolve. It will also be on an 
entirely different level than what you are used to from 
high school. In an engineering class, you have to plan 
more time for digesting and understanding the material 
before moving on to new topics. Most engineering stu- 
dents prefer to see the concepts first in order to be able to 
better follow the lecture content. Engineering students 
spend hours trying to understand the material at home. 
You will also need weeks and months of occasional re- 
visiting until the material is sufficiently digested. This is 
the reason for the late final exam. Understanding is the 
biggest problem, and after you understand the material, 
you have to learn little by heart. Remember, you learn a 
lot in biology, and then you understand it. You have to 
understand and practice a lot in engineering, and then 
you learn a bit by heart. 

Problem-solving 

The major time-consumer will be problem-solving. You 
will be confronted with many tasks and problems. The 
more problems you solve, the better you will understand 
how to apply the information you have learnt and the 
better your grade will be. Solve the tasks we have given 
you! If that is not sufficient, search for more tasks in 
textbooks. Also solve the problems set out in past ex- 
ams. It is important not to give up and to embrace the 
intellectual challenge. Try as many methods and strate- 
gies as possible and always look for possible calculation 
mistakes or typos. Messing up the minus-sign and plus- 
sign is quite common. Only if you get completely stuck 
and consultation of the theory no longer helps, then you 
should look up the solution. 

Study groups 

Everyone has times when they get stuck, and the desire 
to give up is strong. Establish study groups of 3-5 peo- 
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ple to explain the issue to each other. More or fewer 
students than this is ineffective. Group members should 
have more or less the same ability level. Share insights, 
knowledge, and understanding of theories, formulas, 
and equations. Collaborative learning is beneficial, and 
you do not stand in competition with each other. How- 
ever, do the work by yourself first to figure out how to 
get started. In groups, some students might be very fast, 
and then you do not learn how to tackle engineering 
problems on your own. Moreover, never end a group 
meeting when one member has still not understood the 
issue. This is a great opportunity to learn and solidify 
your knowledge by teaching. Find ways to achieve under- 
standing. Maybe one has to figure out gaps in previous 
knowledge and then explain this. Each student should 
explain at the end what the problem was and how the 
solution has been obtained. 

Be flexible and chill a bit 

The general recommended sequence is: 


1. Read lecture notes 

2. Read books 

3. Understand sample questions 
4. Do the homework 


Well, not many engineering students do this. Go jovially 
through the script and if you get stuck for more than 
5 minutes, just go on. Forcing yourself to go through 
the script and trying to understand everything step for 
step has disadvantages. You might read too much, sleep 
away, and at the end the questions still confuse you—and. 
time runs out. A better strategy might be to first read 
the questions in the exercise and try to solve them. 


1. Go through lecture notes calmly 

2. Look at the exercise questions and look at what 
you can solve already 

3. Understand sample questions in the manuscript 

4. Understand the manuscript explanations and search 
out textbooks 

5. Iterate! Go back and forth 

6. At the end, try to understand everything including 
the manuscript 


If you do tasks at an earlier point in this sequence, you 
will get stuck for sure. One expects this without under- 
standing the lecture notes. But now, you have a question 
in mind, and it will be easier for you to understand the 
lecture notes. After you have tried the examples, look at 
the sample questions, and if you struggle there, look at 
the lecture notes and books. Only at the end, and when 
you have tried everything, look at the solutions. Wait 
at least one day before you look up solutions. What we 
want to say is that you will have to use an iterative work- 
ing style between example questions and theory. Do not 
be too strict and harsh with yourself. But of course, in the 
end, you should understand everything, the complete 
handout. Also read textbooks or related papers to get 
a consistent view on the issues and connect new knowl- 
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edge with old. This will help you later in the following 
courses and your career, in a way that only looking up 
things related to solving tasks will not. 

YouTube: Nine study techniques for engineering courses 
Education corner: How to study engineering? 


How to study medicine and biology 

Medicine and biology are characterized by a massive 
workload. As a former mathematics or physics student, 
you might not be used to this enormous amount. You 
might think it unnecessary to learn all this as long as 
you understand the underlying laws—but this is not the 
case. For example, immunology is so comprehensive and 
complex that you genuinely need to learn all this stuff 
before you really start to understand how the immune 
system works. No biologist will ever take you seriously if 
you do not catch up and show a decent knowledge base. 
Moreover, your computational models will fail if you 
do not know enough of all the issues and complexity 
around them. Even if you do not model everything, 
knowing the details is nevertheless crucial. Knowing 
more information helps you to guide your modeling 
better. You need excellent time management, reading 
skills, and memorizing strategies to manage this. You 
will have to read much more, and the biology books are 
much thicker. In these disciplines, it is also helpful to 
teach others. Watch the highly recommended advice 
of a graduate of a medical school, and the organization 
skills of a medical student. Their learning strategies are 
impressive. 

YouTube: Medical School: How to study, read, and learn 
YouTube: Watch an organized medical student 


How to watch educational videos 
Watching educational videos is not like watching a Bol- 
lywood movie. 


Learning objectives: take one to two minutes to think 
about what learning goals you have before starting 
a video. Many videos, linked in this handout, help 
you deepen your knowledge, but do not forget to 
make progress. First go through the handout and 
then use the possibility to go deeper. Plan your 
learning. 


Pause and ponder: if you were not concentrating for a 
moment or you missed the point, rewind or push 
the stop button. 


Speed adjustment: if you can, speed up or slow down 
the video for your convenience. Double-speed? 
Why not? 


Take notes: you cannot ask questions immediately. Jot 
your thoughts down and keep them for the lecture 
in the classroom. Apply the Cornell note-taking 
system: the upper left column (1/3) of your sheet 
is reserved for questions and keywords. The right 
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column (2/3) is used for your notes as usual. At the 
bottom of your sheet is a summary section (5cm). 
Fill the left column and the summary section in 
within 24 hours of taking your notes. It will help 
you reflect on the content. 


Avoid distractions: keep distracting devices like iPods 
and smartphones away. 


Watch in small pieces: if you watch everything at once 
for long periods, it is less efficient than spreading 
the sessions over time. Watch a video every now 
and then. 


Enjoy with peers: you might use the opportunity to dis- 
cuss the content with others so you can learn from 
each other. 


YouTube: Cornell notes method 


Self-directed learner and critical thinking 
Learning habits are set out in stages, as shown by Grow’s 
levels of self-directed learning [5]. 


Stage 1 (Dependent learner): Relies on instructor. No 
self-direction. Task-oriented. 


Stage 2 (Interested learner): Not always directed. Seeks 
some opportunities and sets some goals. 


Stage 3 (Involved learner): Ability to learn individu- 
ally. Has learning goals and methods to achieve 
those goals. 


Stage 4 (Self-directed learner): Sets goals. Knows how 
to assess and how to self-motivate. Finds valid and 
reliable resources. 


YouTube: Self-directed learning (Part 1) 
YouTube: Self-directed learning (Part 2) 


To become a self-directed (self-regulated, lifelong) learner, 
you must learn to assess the demands of the task, evalu- 
ate your previous knowledge and skills, plan your ap- 
proach, monitor the progress, and adjust the strategy 
if needed [3]. Planning the learning process is a step 
which is frequently ignored, and the time required for 
learning to take place is often underestimated. Ponder 
on why you take a certain approach and not another 
one. Also think about what was ineffective last time 
and how this can be improved in the future. Self-critical 
evaluation is important to avoid directing yourself mean- 
inglessly. Keep in mind what Karl Popper! said: "If we 
are uncritical we shall always find what we want: we 
shall look for, and find, confirmations, and we shall look 
away from, and not see, whatever might be dangerous to 
our pet theories". Wisdom and the best approximation 


1 Austrian and British philosopher Sir Karl Raimund Popper 1902— 
1994. 


of truth come only if you are your most merciless but 
constructive critic. It is the right but the hardest way. 
Also watch the lecture series on critical thinking, which 
will help you to become a better scientist. 

YouTube: Critical thinking 

Ina book based on the work of the Foundation for Critical 
Thinking we found the following definition by Francis 
Brown”: "Critical thinking is a desire to seek, patience to 
doubt, fondness to meditate, slowness to assert, readi- 
ness to consider, carefulness to dispose and set in order; 
and hatred for every kind of imposture." 


Repetition or elaboration strategies 

Repetition does not have a significant measurable impact 
on learning. Repetition is the consolidation of something 
but this does not mean you are consolidating something 
useful or correct. Misconceptions can be consolidated as 
well. Thus, feedback from peers and the teacher is im- 
portant. Much more effective is deliberate practice which 
directly targets self-identified weaknesses and requires 
a healthy portion of self-criticizing and critical thinking. 
Additionally, repeating easy tasks does not help you to 
become better. Search for challenges and practice annoy- 
ing or difficult tasks with attainable goals. Moreover, a 
better strategy than repetition is elaboration. Elabora- 
tion deals with the integration of new pieces of informa- 
tion into your existing network of knowledge organiza- 
tion. Elaboration is more effective with high self-activity 
(d = 0.7) rather than letting the teacher do it for you 
(d = 0.44). Make connections to your previous knowl- 
edge instead of repeating facts alone and search for tasks 
which challenge you. 


Approach to dealing with mistakes 

Learning something new opens up space for opportu- 
nity, and if you dare to learn something new, mistakes 
will happen. The more mistakes you make, the more you 
will learn in the long term. Embracing new challenges 
and thus taking the risk of failure will carry you farther 
than avoiding challenges to avoid mistakes (d = 0.44). 
This strategy might lead to problems in the learning pe- 
riod (d = —0.15) but result in better performance after 
the learning period (d = 0.56). This approach is effective 
if the test is similar to the practice tasks (d = 0.2) and 
superior in applying the learned facts to new problem 
types (d = 0.8), which will help you to get even more 
out of this course in the future. Inaccurate prior knowl- 
edge or even misconceptions (the heart oxygenates the 
blood, Pluto is a planet, objects of different masses fall at 
different rates, blind people hear better) are difficult to 
repair if the teacher is unaware of them before the exam. 
Be considerate toward others making mistakes, and do 
not fear embarrassing moments yourself. Your only duty 


2 English philosopher, scientist, jurist, statesman, and author Fran- 
cis Brown 1561—1626. Seen as father of empiricism and scientific 
methods. 
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is to learn from mistakes in order to improve your work. 
Careless and deliberate sloppiness has nothing to do 
with it and is not appreciated. 


ACQUIRE 


Component 
skills 


Figure 5. Elements of mastery. Source: [3]. Copyright © 
2010, John Wiley and Sons. 


Knowledge and skill levels 

Knowledge falls into several types [3]. Declarative knowl- 
edge describes the knowledge of facts and concepts that 
can be stated or declared. Procedural knowledge is 
knowing how to apply various procedures, methods, 
theories, and styles. Contextual knowledge describes 
the ability to know when something has to be applied 
and conceptual knowledge says why something is ap- 
propriate in a particular situation. See also Figure 5 for 
the stages of mastery and Bloom’s Taxonomy in the ap- 
pendix [6] (Fair Use) for the classification of thinking 
skills. 

We not only have different knowledge types, but this 
knowledge is also organized in different ways. The knowl- 
edge organization of beginners shows few connections 
between elements and looks like separated knowledge is- 
lands or a linear sequence of knowledge pieces, whereas 
experts’ knowledge is densely connected—for example, 
in a hierarchical or network form. History facts might be 
memorized along a timeline, but if the question requires 
knowledge organized along other criteria, or the chain of 
knowledge is interrupted, the knowledge might be not 
accessible. Mind maps might be a good possibility of 
connecting pieces of knowledge in different ways. Com- 
petence can also be classified into four different stages, 
as shown in Figure 6. In the beginning, it is impossi- 
ble to know what one has never learned before. After 
a while, one recognizes knowledge gaps and fills them 
until the acquisition process and origin get lost. Profes- 
sors are frequently in the top competence level and may 
find it difficult to identify the problems with which you 
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struggle. Participating during lectures and explaining 
questions clearly can help your supervisor to help you 
become better. 


Unconscious 
competence 


Conscious 
competence 


Conscious 
incompetence 
Unconscious 

incompetence 


Figure 6. Competence levels. Source: [3]. Copyright © 
2010, John Wiley and Sons. 


Summary 

Effect of educational strategies [2] 

d=1.81 Self-efficacy belief 

d = 1.39 Preparation and planning by the lecturer 

d= 1.35 Clear and understandable lecturer 

d=1.13 Deliberate practice objectives 

d=1.12 Clear learning objectives 

d=0.90—0.98 Attending courses regularly 

d=0.77 Openness to outsider opinions 

d=0.75 Student’s regulation of effort 

d =0.68 Group work 

d=0.68 Empathy and warm-heartedness of the teacher 

d = 0.67 Previous skill level 

d=0.65 Teacher-student relationship quality 

d =0.64 Co-operative learning 

d =0.57 Teacher’s enthusiasm 

d=0.49 Diligence 

d=0.48 Motivation 

d =0.47 Supportive atmosphere in the classroom 

d=0.47 Encouragement of learning 

d =0.43 Solely making notes during oral presentations 

d=0.41 Addressing learning progress 

d=0.41 Organized learning 

d=0.41 Learning with fellow students 

d=0.41 Time management 

d=0.34 Disturbances during lessons 

d =0.32 Critical thinking 

d = 0.32 Intrinsic motivation 

d=0.21 Class size 

d =0.21 Compulsory attendance 

d=0.19 Co-teaching 

d= -—0.02 Making notes during PowerPoint presenta- 
tions 

d= -—0.43 Fear exams 

Disturbing effects on exam results variability is minor 

(n=403623 students, 911 effects investigated): 

6% Procrastination 

5% Diligence 

4% Fear of exams 
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4% Intelligence 

3% Emotional intelligence 
2% Socio-economic status 
<2 % Biological age 

<2 % Gender 

<2 % Extroversion 

<2 % Self-esteem 

<2 % Social support 

<2 % Stress 

<2 % Depression 


5. Learning checklist 


Check boxes if appropriate. It might help you not to 
forget things and might inspire you to promote your 
self-directed learning. Try to stay within the script and 
do not lose too much time by finding answers. Use it 
as an inspiration and organization tool. You should ask 
yourself the following questions [7]: 

e What is it and how is it defined? (declarative) 

e How is this theory applied in the real world? (pro- 
cedural) 
Could you provide an example of when this for- 
mula might be used? (contextual) 
Could you sketch what that (solution, device, etc.) 
might look like? (procedural) 
How is this equation applied in practice? (proce- 
dural) 
Where did that formula come from? (conceptual) 
Do you understand when that formula is used? 
(contextual) 
Remember [3]: Declarative knowledge describes the 
knowledge of facts and concepts that can be stated or de- 
clared. Procedural knowledge is knowing how to apply 
various procedures, methods, theories, and styles. Con- 
textual knowledge describes the ability to know when 
and in which context something has to be applied and 
conceptual knowledge says why something is appropri- 
ate in a particular situation. Does the concept fit your 
application? Do you know the concept behind a certain 
definition? 
Another definition can be found (trainingindustry.com): 


Definition 1. Conceptual knowledge refers to the knowl- 
edge of, or understanding of concepts, principles, the- 
ories, models, classifications, etc. We learn conceptual 
knowledge through reading, viewing, listening, experi- 
encing, or thoughtful, reflective mental activity. 


Definition 2. Declarative knowledge refers to facts or 
information stored in the memory, that is considered 
static in nature. Declarative knowledge, also referred to 
as conceptual, propositional or descriptive knowledge, 
describes things, events, or processes, their attributes, 
and their relation to each other. It is contrary to procedu- 
ral, or implicit knowledge, which refers to the knowledge 
of how to perform or operate. 


Definition 3. Procedural knowledge refers to the knowl- 
edge of how to perform a specific skill or task, and is 
considered knowledge related to methods, procedures, 
or operation of equipment. Procedural knowledge is 
also referred to as implicit knowledge, or know-how. 


Definition 4. Implicit knowledge is knowledge that is 
gained through incidental activities, or without aware- 
ness that learning is occurring. Some examples of im- 
plicit knowledge are knowing how to walk, run, ride a 
bicycle, or swim. 


Example: 

The determinant of a 2-by-2 matrix is the area between 
two linear independent vectors (declarative). It can be 
computed in the following ways (procedural). The de- 
terminant is useful to understand whether a matrix is 
invertible (contextual) and only works if the matrix is a 
square matrix (contextual). The determinant is based on 
the geometric intuitions and concepts of linear algebra 
in the following way (conceptual). 
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Part 1: Biochemical network in the matrix form 
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Part 2: Metabolic modeling 
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Part 3: The magic of change and how to find it 
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Part 4: Physical modeling and nonlinear enzyme kinet- 
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6. Further reading 


We suggest reading the great article by Barabasi et al. to 
familiarize yourself with the topic of network medicine 


[8]. 
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Recall /regurgitate facts without 


understanding. Exhibits previously 


learned material by recalling facts, 


terms, basic concepts and answers. 


To show understanding finding in- 
formation from the text. Demonstrating 


basic understanding of facts and ideas. 


>< 


To use in a new situation. Solving 
problems by applying acquired knowl- 
edge, facts, techniques and rules ina 
different way. 


To examine in detail. Examining 


and breaking information into parts by 
identifying motives or causes; making 
inferences and finding evidence to sup- 


port generalisations. 


HIGH LEVEL THINKING SKILLS 


To change or create into some- 

thing new. Compiling information to- 
gether in a different way by combining 
elements in a new pattern or proposing 
alternative solutions. 


To justify. Presenting and defend- 

ing opinions by making judgements 
about information, validity of ideas or 
aa of work based on a set of crite- 
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Can you explain what is happening... 
is meant...? 


Can you list three ...? 

Can you recall ...? 

Can you select ...? 

How did happen? 
How is ...? 

How would you describe ...? 
How would you explain ...? 
How would you show ...? 
What is ...? 


When did ...? 
When did happen? 
Where is... ? 
Which one ...? 


Who was ...? 
Who were the main... ? 
Why did ...? 


How would you classify the type of ...? 


How would you compare ...? 


?contrast ...? 


How would you rephrase the meaning ...? 
How would you summarise ...? 

What can you say about ...? 

What facts or ideas show ...? 

What is the main idea of ...? 

Which is the best answer ...? 

Which statements support ...? 

Will you state or interpret in your own 


words ...? 
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what 


How would you use...? 
What examples can you find to ...? 


How would you solve using what 
you have learned ...? 
How would you organise to 


show ...? 

How would you show your understanding 
Of 2? 

What approach would you use to...? 

How would you apply what you learned to 
develop ...? 

What other way would you plan to ...? 
What would result if ...? 

Can you make use of the facts to ...? 
What elements would you choose to 
change ...? 

What facts would you select to show ...? 
What questions would you ask in an inter- 
view with ...? 


What are the parts or features of ...? 
How is related to ...? 

Why do you think ...? 

What is the theme ...? 

What motive is there ...? 

Can you list the parts ...? 

What inference can you make ...? 
What conclusions can you draw ...? 
How would you classify ...? 

How would you categorise ...? 


Can you identify the difference parts ...? 


What evidence can you find ...? 
What is the relationship between ...? 


Can you make a distinction between ...? 


What is the function of ...? 
What ideas justify ...? 
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What changes would you make to solve...? 
How would you improve ...? 

What would happen if...? 

Can you elaborate on the reason...? 

Can you propose an alternative...? 

Can you invent...? 


How would you adapt to create a 
different...? 

How could you change (modify) the plot 
(plan)...? 


What could be done to minimise 
(maximise)...? 

What way would you design...? 

Suppose you could what would 
you do...? 

How would you test...? 

Can you formulate a theory for...? 

Can you predict the outcome if...? 

How would you estimate the results for...? 
What facts can you compile...? 

Can you construct a model that would 
change...? 

Can you think of an original way for the ...? 
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What is your opinion of...? 

How would you prove/disprove...? 

Can you assess the value/importance of...? 
Would it be better if...? 

Why did they (the character) choose...? 
What would you recommend...? 

How would you rate the...? 

What would you cite to defend the ac- 
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How would you evaluate ...? 
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What choice would you have made...? 
What would you select...? 

How would you prioritise...? 

What judgement would you make about...? 
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Chapter 1: Biochemical networks in the matrix form 


Thomas Sauter, Marco Albrecht 


Motivation 

The biochemistry of the cell is very complex and the available data might overwhelm the abilities of interpretation 
[1]. Reductionist approaches, combined with some intuition, have brought us far, but we need rational approaches 
to better understand the interplay of molecules at the system level. We have to check whether a hypothesis is in 
itself logical and can be aligned with data. In this chapter, we—while reducing biochemical molecules to their 
function—will learn how to interlink several players to acquire a mechanistic understanding of a pathway or a 


complex system. Modeling thereby helps us in the following ways [2]: 


1. Enhancing understanding of otherwise unintelligible systems 
2. Requiring a way of thinking that can be beneficial to the design of experiments 


While, by studying this chapter, you will not become a computational scientist, it will help you to communicate 
with them. Nobody expects that you understand everything immediately. It will take time to digest and it requires 


a lot of practicing to build the skills. 


The mathematical principles introduced here will be applied to biological pathways and networks in the following 
chapters. If you prefer, you could directly jump to Chapter 2 and 3 to see some applications first. 
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1. Lecture summary 


1.1 M Biological networks and graph theory 
Biological phenomena are very complex and systems 
biology helps us to understand their system’s behavior. 


Definition 1. Systems biology is the science that studies 
how biological function emerges from the interactions 
between the components of living systems and how these 
emergent properties enable and constrain the behaviour 
of the components [2]. 
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Figure 1. The interaction graph of a signal transduction 
pathway triggered by the inputs I and I2 with subse- 
quent response in the output layer indicated by O1 and 
O2. 


Put in another way: we are not interested in dissect- 
ing objects into ever smaller parts and details. Instead, 
we look at the elements (nodes, states) we know and 
focus on their interactions (edges, coefficients). The in- 
terplay of a few elements can result in manifold different 
phenomena and observations, depending on how the 
elements activate or inhibit each other. The interactions 
between the states in a system are often represented as a 
matrix. But the type of matrix used can vary for different 
systems. 


https: //doi.org/10.11647/OBP.0291.01 
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Example 1: Signal transduction 


The example graph in Figure 1 can be written in 
the form of an incidence matrix In 


t+ o- t+ + + + = + + + + + + 
f-1 -1 0 0 0 0 0 0 60 0 0 0 oj 
0 0 -! 0 0 0 60 60 0 0 0 0 0} 2 
0 0 0 -! 0 0 1:0 0 0 0 0 0} 4 
1 0 0 1 -1 06 0 0 0 0 0 0 0] .B 
ec 0 0 0 0 1 -1 0 1 0 0 0 -1 Oo} © 
oo 0 0 0 0 0 1 -1 0 0 0 0 0 0} D 
o 1 1 0 0 0 60 -1 -1 0 0 0 O|]F 
0 0 0 0 0 60 0 60 1 -1 1:0 Of} F 
0 0 0 0 0 0 60 60 0 1 -1 0 -1}6 
0 0 0 0 0 0 0 0 0 0 0 1 0 }or 
0 0 0 0 0 0 0 60 0 60 0 0 1 }o2 


with interactions as column entries and states as 
row entries. 


Example 1 mimics a signal transduction network 
within a cell (Figure 1), where relevant molecules are 
represented as nodes (states) and interactions as edges 
(later resulting in mathematical terms in the balance 
equations). The states represent the phosphorylation sta- 
tus or the concentration of a particular molecule, while 
the interactions represent binding affinities, regulatory 
interactions, or metabolic fluxes etc. If molecule A is 
directly responsible for a higher activity or abundance 
of molecule B, we draw an arrow from A to B, which is 
called a directed edge. Moreover, we write in the related 
column of the incidence matrix In (see Example 1) the 
number —1 for A and +1 for B. Molecule B, on the other 
hand, has a positive impact on C. Molecule A would 
thus indirectly lead to higher levels of activation of C, 
but no direct interaction, so this is not represented in 
the network. Edges can also represent inhibitory interac- 
tions, which are drawn as a straight line with a transverse 
line at the inhibited molecule. The true interactions can 
be figured out through experimental studies or via the 
analysis of the overall behavior of a network. Biologi- 
cal systems can also be represented in the form of an 
adjacency matrix A or as an adjacency list L, tackled in 
Case Box 1 and 2. The combination of elements and 
interactions makes up a graph or network. Protein inter- 
actions can be represented as undirected networks. One 
valuable source for such networks is, for example, the 
STRING database [3]. 
We want to compare a directed network with an undi- 
rected network by reference to the cases in Figure 2. We 
note down the related matrices in Case Box 1 and 2. 


ABC A-*+B-8sC 
g a4 c wee [ 

F e e d 
Figure 2. Left: undirected graph. Right: directed graph. 
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Graphs are a special case of more general hypergraphs, 
shown in Figure 3. In a graph, edges connect 2 nodes, 
whereas in a hypergraph H = (V,E) there is a set of hy- 
peredges E connecting a set of vertices V. In other words, 
in a hypergraph, a hyperedge can connect any number 
of vertices. Hypergraphs are used, for example, used to 
represent metabolic networks where reactions can con- 
nect multiple substrates and products and sometimes 
involve cofactors. Undirected hypergraphs represent set 
systems, as shown in Figure 4. Directed hypergraphs 
have hyperedges e = (S, K) with vertices assigned to the 
tail/start knot S and vertices assigned to the head/end 
knot K. An example directed graph is shown in Figure 5. 
One of the hyperedges points from the tail knots A and B 
(S = {A, B}) to the head knots C and D (K = {C, D}) writ- 
ten as e; = ({A,B}, {C, D}). Pay attention to the brackets. 


Case 1: Undirected graph 


An undirected graph is described by vertices V = 
{A,B,C,D,E,F} and edges E = {a,b,c,d,e, f,g} = 
{(A,B), (B,C), (C,D), (DE), (DF), (C.F), (FA)}. 
The relevant matrices (see text) are: 


A BC DE F 
0 10 0 0 IjA B,F |A 
101 0 0 O;B A,C |B 
A=]0 101 0 1/C, L=|B,D,F\C, 
001 0 1 IID C,E,F|D 
0 0 0 1 0 O;F D |é—E 
1 01 1 0 O|F A,C,D| F 
abedef g 
100 0 0 0 I}jA 
1 10 0 0 0 O}B 
In=|0 1 1 0 0 1 O/C 
0 0 1 1 1 0 O|D 
000 1 0 0 O;F 
000 01 1 I/F 


whereby the adjacency matrix A is symmetric. It is 
not symmetric for a directed graph. 


Case 2: Directed graph 


The directed graph is represented by vertices V = 
{A,B,C, D,E, F} and by edges E = {a,b,c,d,e, f,g} 
= {(A,B), (B,C), (D,C), (D,B), (F.D), (C.F), (F,A)}.- 
The order of vertices in the edge description is rel- 
evant now. The matrices are: 


A BC DE F 


0 10 0 0 OjA BIA 

0 0 1 0 0 O;B C |B 

A=]0 00 00 1/C, L=| F /C, 

0 0 1 0 1 OD C,E|D 

0 0 0 0 0 O;F —|E 

100 1 0 O|F A,D| F 

a b c dee fg 

-1l 0O 0 0 0 0 1|A 

1 -1 0 0 0 0 0 |B 

In= | 0 1 1 0 Oo -1l O|;C 

0 Oo -1 -1 1 0 0 |D 

0 0 0 1 0 0 OE 

0 0 0 QO -l 1 -—-I/F 
hypergraph substrate graph bipartite graph 
A,  Ac*c AY LC 

< coms a a 
B D B—D B D 


Figure 3. A hypergraph can be translated into a substrate 
graph or a bipartite graph. A substrate graph cannot 
be converted back to the hypergraph because the infor- 
mation whether A AND B are consumed by the same 
reaction is lost in the substrate graph. It could also be 
possible that A is converted into C and D, and that B is 
transformed into C and D by independent reactions. 


V7 
Undirected hypergraph: v= 


Figure 4. 
{v1,¥2,V3,V4,V5,¥6e,v7} and E = 
{{v1,v2,v3}, {v2, v3}, {¥3,V5,V6}, {va} f- 
en.wikipedia.org /wiki/Hypergraph, Fair Use. 


{e1,€2,€3,e4} = 
Source: 
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A CSE 


= 
e3 


B D 


Figure 5. Directed hypergraph V = {A,B,C,D,E}, 
E = {e1,e2,¢3} = {({A, 8}, {C,D}), ({C}, {EZ}), 
({E,D}, {B})}. 


Example 2: Simple hypergraph 


The example in Figure 5 can be represented by the 
incidence matrix: 


el e2 e3 
—1 0 OA 
—-l1 0O 1/B 
es. 1 a he 
1 0 -—-1/D 
0 1 -1/E 


Another representation of the system in Figure 5 
can be realized with chemical reaction equations: 


A+B -=4C+D 
C248 


D+E—=>B 


Directed hypergraphs, as mentioned previously, are 
needed for an important field in systems biology: metabolic 
network modeling. The only difference is that we have 
additional stoichiometric information (coefficients) to 
weight the edges. 
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1.2 Mi Modeling of metabolic networks 

Metabolic networks describe the flux of metabolites within 
a system. The fluxes are controlled by enzymes. A sim- 
plified metabolic network is shown in Figure 6 and Ex- 
ample 3. A more elaborate example is shown in Figure 7 
and Example 4. 


Example 3: Simple metabolic network 


We see in Figure 6, that we have two possible path- 
ways here, either via v2 or v3. All edges can be 
weighted according to the stoichiometric coeffi- 
cients so that we do not simply have an incidence 
matrix (all the entries are 1s or 0s) but a stoichio- 
metric matrix: 


Figure 7. A metabolic model with 2 inputs and 3 outputs, 
whereby Molecule B is both. The system is controlled 
by the transport reactions of the external Metabolites A 
and B to the cell, which releases Product P but can also 


i te ea release B and E. 


| 1 -1 -1 0 |Ann 
N=/0 1 1 -1]Bp. , 
—[ - o & BA. Example 4: Larger metabolic network 
0 0 0 1 Bout 


Here, we see several possible routes (Figure 7) and 


where the reactions determine the columns and also a hyperedge vio. It is quite difficult without 


the metabolite concentration determines the row 
entries. Often we focus on intracellular metabolites 
only, so that we can reduce the system in this case 


mathematics to understand and predict the fluxes, 
which we will learn in the next learning block. For 
now we want to set up the stoichiometric matrix: 


to: Vi v2 V3 V4 (V5 Ve V7 vg vo VIO 


1 0 0O Oo -1 —-1 -!1 O 0 OA 
V1 i) V3 V4 0 1 0 0 1 0 0 —1 —l1 O0O;B 
N=/1 —-1 —-1 O]A N=j0 0 0 0 0 1 0 +1 O —-I1;C 
0 0 O 0 0 0 1 0 0 -—-1|D 
1 1 —-1/1B 
0 00 0 -1 O 0 0 0 0 1|E 
: ‘ : 0 0 -1 O 0 0 0 0 1 1|P 
This example has only unimolecular reactions. ; : 
P y with the reversible reactions: 
rev = {R2,R8} 


and irreversible reactions: 
irrev = {R1,R3,R4,R5,R6,R7,R9, R10} 


Me et ee il After seeing some motivating examples of biological 


Figure 6. A simple model of a metabolic system. The gray Networks and the possibility of representing these as 
dotted line represents the system boundary. A molecule  ™atrices, we will now revise some basic mathematical 
comes from outside to inside and turns into internal Concepts of linear algebra and matrix calculation. 
Molecule B via two possible reaction ways. Molecule B 

is leaving the system. 


2. Basics of Mathematics 


2.1 M@ Linear algebra 

Linear algebra (Arabic: al-jabr) is one of the most funda- 
mental and helpful topics from the realm of mathematics. 
We recommend watching the following YouTube videos 
and more from that channel the channel the videos be- 
long to, by clicking on the link below: 

YouTube: The essence of linear algebra 

YouTube: How to read math? 


@ From a set of equations to a matrix 

In science and technology, we frequently encounter sets 
of linear equations. One equation might be 3x; +x2 = —2, 
which we can write in a general form with a11x) +a)2x2 = 
b, with coefficient a}; = 3, aj2 = 1 and constant b; = 
—2. The variables, x, can represent molecule concentra- 
tions, and coefficients can represent interactions between 
molecules. If we add at least one other linear equation— 
such as 2x; + 1x2 = 0—with at least one common variable, 
we have a linear equation set: 


aX + a12x2 = by 


421X1 + 22x27 = bz 


You see that the first index m of coefficient a,,, increases 
with the number in rows, while the second index n in- 
creases with the number of the variables x,,. The example 
equation set can be solved for x: 


3xj tx. =—2 © xX =—-2-—3x 


2x, +x2 =0 x= 2x 

which can be geometrically interpreted as shown in Fig- 
ure 8. The solution is the point x; = —2 and x. = —4, 
where the vectors cross. You either get one solution, no 
solution, or an infinite number of solutions for any linear 
equation set. The general form, with m equations and n 
variables, is then: 


ay1x1 +a 42x2 + +4nXn = d, 
a2\X| +a22x2 + +4nXn = bo 

(2.1) 
Ami X1 + An2X2 + +dmnXn = bm 


which can be written in a much denser form as: 
Ax = b 
with a matrix in a bold capital letter: 


a\1 a20°"" ain 
a2) 4220 =«°"" Q2n 


Ami Am2 Amn 


Chapter 1: Biochemical networks in the matrix form — 5/24 


: “4 3 2 A 0 1 2 
zy 


Figure 8. A linear equation system as vectors. The solu- 
tion of the equation system is the cross-section of both 
vectors. 


and vectors in bold lowercase letters: 


xX] by 

x2 bo 
x= and b=]. 

Xn bin 


This is very convenient and compact. The solution is 
simply x = A~'b. The exponent —1 indicates the matrix 
inverse and will be explained later. The matrix A basi- 
cally describes n arrows with the arrow tail in the origin 
(zero-point), and the arrowhead on m coordinates in a 
space spanned by the coordinate system. The equation 
set 2.1 describes an inhomogeneous system. It becomes 
a homogeneous system if b = 0. If the system has a 
solution we have a consistent system, otherwise it is 
inconsistent. A linear equation system is also fully de- 
termined by the augmented coefficient matrix: 


a, a2 ++ Ain | DY 

a2} 422, *** an | bp 
(A|b)= 

Ami m2 Ann | Om 


YouTube: From an equation set to a matrix 


@ Simple matrix operations 

Let’s say we have a matrix B and multiply it by a matrix 
A from the left. What does it mean geometrically? It 
means that we transform the system B in a way that can 
be a rotation, a scaling, or any form of linear deformation 
of the space, which is spanned by the coordinate system 
of B. If the matrix B has p arrows pointing toward n 
coordinates within a n dimensional space, the multipli- 
cation of B by A from the left transforms the coordinate 
system from n dimensions to m dimensions. The final 
matrix C represents p arrows pointing to the new coor- 
dinates in a m dimensional space. Have in mind that the 
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order of matrix multiplication matters in contrast to the 
multiplication with numbers. Because A transforms the 
coordinates of B from one space to another, the number 
of columns in A must equal the number of rows in B in 
order to have sufficient coupling of two spaces for the 
transformation or multiplication AB = C. 


caf} 


Division by a matrix does not exist, but division of a 
matrix by a scalar is possible. 
YouTube: Multiplying matrices 


Example 1: Multiply matrices 


A more detailed scheme is: 


4x2 matrix 4x3 matrix 


a1, 412 2x3 matrix * C12 C43 
bir by3} 
431 432) |: bo ba - ©3233 


where two c elements are calculated as follows: 


C12 = a1 b12 + aj2b22 


€33 = 43113 + a32b23 


What is the sum of two matrices geometrically? The 
matrix A has n arrows originating from the coordinate 
origin point to the m coordinates. If we sum up with 
matrix B, the n arrows of B start from the coordinates of 
A and land on the coordinates A+ B. This is equivalent 
to having n arrows starting from the coordinate system 
origin and pointing to the coordinates described by A + 
B. 

YouTube: Sum up matrices and scalar multiplication 


Example 2: Sum and subtract matrices 


The sum or subtraction of matrices with identical 
size is calculated entry-wise: 


1 3 Mh aes OOP le 3+0 145 
1 0 0 7 5 O} [147 045 0+ 


fl 3 6 
~{8 5 Of” 


The scalar multiplication is also a transformation of the 
matrix. If you multiply a matrix by a number, you scale 
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the matrix by this number without skewing or rotating it. 
The scalar multiplication by two doubles all coordinate 
values the arrows point to. Do not confuse this with 
scalar product, which is a form of inner product! 
YouTube: Scalar multiplication 


Example 3: Scalar multiplication 


Another important operation is transposition (to inter- 
change columns with rows). 
YouTube: Transpose a matrix 


Example 4: Transpose 


tt 2. 3 4 9 
6 7 8 9 10 


M=\ oe 9g 46 a. 15 
16 17 18 19 20 
1 6 11 16 
2 F. 12-17 
M=| 3 8 13 18 
4 9 14 19 
5 10 15 20 


@ Square matrices 

Square matrices have as many rows as columns. Some 
square matrices are especially secure. One example is 
the matrix in diagonal Jordan form, which was the aim 
of the Gauss-Jordan method: 


a{1 see 0 
A = 
QO  ... Ann 
Multiplying a diagonal matrix multiple times from the 


left is the same as using the number of multiplications 
as exponents of the diagonal elements: 


ayy... O ayy... O ayy... O 
A’B= |: |]: |: : |B 
QO...) Ann QO... Ann QO  ... Ann 
ay 0 
=|: : |B 
er 


nn. 


This is much better than multiplying non-diagonal ma- 
trices, which can be very frustrating after a while. A 


very important special form of the diagonal matrix is the 


identity matrix, with which has 1s as diagonal elements: 


YouTube: Identity matrix 
Also frequently mentioned is the zero matrix: 


0... O 
O=: |: : 
0... O 


One characteristic specifically of a square matrix is the 
trace. The trace is the sum of the diagonal elements: 


n 
tr(A) =} ay = ay +022 +++ +4 
i=l 


YouTube: See also the symmetric matrix 


@ Determinant 

The determinant gives the area in a 2-by-2 matrix and 
the volume in a 3-by-3 matrix. What does it mean when 
the determinant is equal to zero for a 2-by-2 matrix? It 
means that the area of the matrix is equal to zero and 
therefore the vectors that compose the matrix are linear- 
dependent. In other words, it means that the vectors 
have parallel directions. From high school, we know that 
we can describe the position of any point y of a line ina 
function as an Origin O and a constant (c) that multiples 
a non-zero vector v (y= c-v+0). For a plane and space, 
any point can be described as a linear combination of two 
independent vectors, respectively. Consequently, if the 
two vectors are linear-dependent, we are no longer able 
to describe any point in the plane, but only the points 
that are situated on a line that is parallel to the vectors. 
For 3-by-3, a determinant of zero indicates that at least 2 
of the 3 vectors are linearly dependent and therefore only 
the point located on a plane can be described by this set 
of vectors. More generally, a matrix with a determinant 
of zero describes a transformation of the system that 
reduces its dimensions by 1. It is possible to collapse 
a system to lower the number of dimensions, but the 
opposite is not possible. Therefore, the inverse of matrix 
A with the determinant of A equals zero, which would 
geometrically result in an expansion of the system to a 
higher number of dimensions. This is not possible. 


det(4) =|"? 


[= ad ~ be 


One possibility for calculating a determinant is via the 
initial reduction of the matrix. Larger matrices can be 
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split into smaller matrices with the Laplace expansion: 


a bed 
e f gih 
i j kl 
m no p 
f gih e gh 
=+a:|/j k I\)—b-}i k 1 
no p mo p 
e fh e fg 
+c j lj-d-ji j ek 
n p m n oO 


Please pay attention to the alternating signs (+/-). 
YouTube: Laplace expansion or cofactor expansion 
Laplace expansion can be coupled with the Gauss method, 
as shown in Example 5. 
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T T T T T T T T T T 
=a 0 1 2 3 A 0 1 2 3 


Figure 9. Geometrical interpretation of the matrix inverse and determinant impact. Left: |A| = 1. Right: |A| = 3 [4]. 
Copyright © 2021, Stack Exchange Inc, Licence: CC BY SA. 


Example 5: Determinant after reduction with Gauss ™ Inversion 
An invertible matrix A has linear-independent rows and 


columns. A matrix B is uniquely determined by A, if one 


1) 13 2 —6 jl 3 2 —6 has the symmetric and invertible matrix: 
de(Ay- U-D fl 2 2 -|_Jo -1 -4 1 
: WH 21): 2 8 OG SG AB=BA=I 
IV—2I) |2 4 6 9 0 -2 10 3 


with identity matrix I. 
The second row minus Row I, and Row III and IV 
minus 2 times the first row, gives the right matrix. 
Laplace expansion (see later) gives us the lower 
right matrix block. Further reduction and an ad- 


Example 6: Inverse interpreted geometrically 


Geometrically one can interpret the matrix [4] 


ditional isolation of the lower right block result in: R 1] 
ai = 
=i -l -4 1] 14 -1 e 4 A= fi | with related vectors 
io |2 26 4S10 S. 2 ale i a=(1 1] 
Mm=21) |-2 —10 3) jo —2 1 
=2.1-1.(-2)=4 as an area in a 2-dimensional (2D) coordinate sys- 
tem shown in Figure 9 left in red. The shape of the 
inverse in blue is rotated by 90 degrees and is small 
A reduction is not always possible. The determinant in the directions where A is large. The vector a’ is 
of a 3-by-3 matrix can be obtained with the the rule of at right angles to a, and a! is at a right angle to ap. 
Sarrus,' where the first two columns can be written be- The determinant |A| describes the area in 2D and 
side the determinant to facilitate the optical assessment the volume in 3D. Here, the determinant |A| = 1 is 
of the diagonal product: one, and the area is preserved. If we change the 
matrix 
a b cl a b a bc] a _b 
de f\ de d & f(a e 4 a=([2 1 
g h il eh g h i] g A A= i i with related vectors 
abe ay=[1 2] 
de f = teeith sf gte dy) to get another determinant |A| = 3, we see that the 
gh a) emma tamer rey area is changed. 


Determinants can also be solved efficiently with the LU We use Cramers rule: 

decomposition explained in Example 10. ee det(A;) _ |Ai| Ae: 
YouTube: The determinant det(A) —|A| 

with the site determinant |Aj| to find the inverse analyt- 

1 French mathematician: Pierre Frédéric Sarrus (1798—1861). ically. The cofactors of the Laplace expansion can be 


saved in the so-called matrix of cofactors or comatrix C. 
Its transposed version C’ is the adjugate matrix: 


Ci Car ee. Cui 
1 1 1 [C2 Co ... Cr 
=i T . 
= —_—C = —_adj(A) = — ; 
|A| |A| |A| : 
Cin Con tee Cnn 


Inversion of a 2 x 2 matrix 
ae bb) 1 fd -b]. 1 ad 
[ce d} — det(A)|-e a} ad—be|-c a 


Inversion of a 3 x 3 matrix 


| 
A=|d eJ)f A= — oC 
|A| 


gh i 

e f d f de 

- hi - i +( h 

bec ac a b 

o> ae g i ie dH 

ds c) _|@ ¢| | (a b 

e f| \ld f de 
+(ei— fh) —(di-—fg) +(dh—eg 
C=|-(bi—ch) +(ai-—cg) ah — bg 


cT = |-(di- fg 


YouTube: Inverse of a matrix 
YouTube: Solve a linear equation set with the inverse of 
a Matrix 


@ The rank 

Often, matrices derived from linear equation sets can be 
reduced, as not all equations are necessary to describe 
the system. We often search for the minimal matrix, 
which is described by the number of linear-independent 
rows or columns. Linear- independent rows and columns 
are sets of rows or columns where none of the rows or 
columns is a linear combination of the others (example 
of linear dependency: Row 1 equals to the sum of Rows 
2, and Row 3 or Column 3 is three times Column 1. The 
number of linear-independent rows or columns is the 
rank of a matrix: 


rank(A) = rank(A’ ) = rk(A) 
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This equation states that the rank computed on the rows 
is equal to the rank obtained on the columns and there- 
fore, by definition, the rank of a matrix cannot be greater 
than the number of rows and columns in this matrix. In 
other words, if a matrix B has 6 rows and 2 columns, we 
can deduce that the rank is smaller than or equal to 2 as 
the rank cannot be greater than the number of columns. 
Consequently, a non-square matrix has by definition at 
least one linear-dependent column or row. For our ma- 
trix B, we know that we have at least 4 linear- dependent 
rows as we have 6 rows and the rank is smaller or equal 
to 2. The Gauss elimination is used determine the rank 
of a matrix by producing as many zeros as possible in the 
hope of removing as many rows or columns as possible, 
which results in the reduction of dimensions. 

YouTube: Rank 


@ Solving a set of linear equation sets 

Naive solving of a linear equation set can be very time- 
consuming. 

YouTube: Naive solving of a linear equation set 

We should start to check whether an equation set is solv- 
able. The set of linear equations is solvable if the rank of 
the coefficient matrix A equals the rank of the augmented 
coefficient matrix (A|b). 


Example 7: Trivial solution only 


2x1 +x2 =0 I 
xj —x% =0 II 
gives: 
I: X, =X 
inl: 2x2 +X2 =0 
Xj =x. =0 


This is a trivial solution. Did we have a chance to 
find this out earlier? 

We have a homogeneous system with as many equa- 
tions m as variables n. 


2 1 
rk] | =2=n 


which indicates that we only have trivial solutions 
according to the Rouché-Capelli theorem. Because 
the determinant is also non-zero: 


2 1 
det i _ i = 
we would also not expect that the system is going 


to lose a dimension. Consequently, we expect only 
trivial solutions. 


~2-1=-340 
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rank(A) = rank(A|b) 


If the matrix is quadratic with m = n and the determinant 
is not zero, then the set of linear equations is solvable. 
In a homogeneous system Ax = 0, it only leads to trivial 
solutions a) = )2...Qmn = 0 Or x] =X2...X, =0. Inanin- 
homogeneous system we get a unique solution. If m <n 
the the rows or columns are linear-dependent. Thus, 
the related homogeneous system Ax = 0 has non-trivial 
solutions, which is good. If m <n, the inhomogeneous 
system has many non-unique solutions. The rules are 
summarized in the Rouché-Campelli theorem. We illus- 
trate this with two examples, 7 & 8. 


Example 8: Non-trivial solution 


2x; + 2x2 =0 I 
—x,—-1x2 =0 Il 
gives: 
II X, = —X2 
inl — 2x7 + 2x2 =0 


choose x2 = A to get the solution: 


xX) \ —A 

xo) \A 
What would we expect? According to the Rouché- 
Capelli theorem we have many non-trivial solu- 


tions because we have a homogeneous system with 
fewer equations m than variables n: 


2 2 
rk] 2, 4 =l<n 


and the determinant also indicates a loss of one 
dimension: 


2 
det E 


’ =-242=0 


A set of linear equations can be solved with the Rule of 
Cramer: 
— det(A;) Ze |A;| 
“i Get(A) [A] 


i=l,...,n 


using the site determinant |A;|. The side determinant |Aj| 
is the determinant |A| where the i, column is replaced 


by column vector b. 
YouTube: Cramer’s Rule — 2x2 and 3x3 matrices 
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Example 9: Cramer’s Rule 2x2 matrix 
Consider the linear system: 


aX] +a42x2 = db) 


421 X1 + 422X2 = bo 


which in matrix format is: 


a1 42} |x} _ fbi 
a2} a2} |x2| | bo] 


The Cramer Rule is then: 


bi ay 

by ax b\ a2 — ay2b2 
XxX] = = 3 

a1 412) = 44122 — 412421 

a2, a2 

ayy P| 

az, bo ayyb2 —b\ ar, 
x2 = = 

a1 412) 41122 — a12421 

a2| a22 


The denominator (fraction below) is not allowed to 
become zero which explains why the determinant 
of A must be non-zero. 


The Cramer Rule visualizes the resolvability but it takes 
too much effort to get the solution. The solving process 
is faster when one subtracts rows stepwise from each 
other to bring the augmented coefficient matrix into a 
certain form. Either in the Gauss form (row-echelon 
form): 


eee el|e 
0 e e e|e 
AOD) ayo. G: @ |e 
0 0 0 e/e 


with as many zeros as possible in the lower left-corner, 
or in the Gauss-Jordan form: 


e 0 0 O/e 

_ {| 0 e e OjJe 
APIS Gr wills 
0 0 0 ele 


with as many zeros as possible in the lower-left and 
upper-right corner. If one eliminates the elements with 
the largest absolute value first, one reduces rounding 
mistakes. This is called pivoting. The row-echelon form 
becomes a reduced row-echelon form if every leading 
coefficient is 1 and is the only non-zero entry in its col- 
umn: 


(A|b)= 


oon Kk 
an 


41 #1 
41 #1 
0 1 
0 1 


ooor 


We have three types of elementary row (column) opera- 
tions: 


e Interchanging two rows (columns) 
e Multiplying a row (column) by a real number 


e Adding a multiple of one row (column) to another 
row (column). 


If we bring the Gauss form back to the equation set form, 
we get: 


ayxXy+ay2x2 + 8s +A1nX_ = DY 
aggX2+ + +X, = bo 
AmnXn = bm 


where we can get our x values by working from the bot- 
tom to the top. If it is in the Gauss-Jordan form, we can 
directly read the solution. 

YouTube: Gauss and Gauss-Jordan elimination 
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Additional reading 


In practice, software usually uses more efficient 
algorithms like the LU decomposition.” The LU 
decomposition dissects a matrix A into two fac- 
tors: a lower triangular matrix L and an upper 
triangular matrix U: 


A=LU 
a43 Ny 
an3| = |bi 
433 131 


a\1 
a21 
431 


43 
U3 


a12 
a22 
a32 


u22 


It resembles the Gauss elimination, whereby the 
upper triangular matrix can be compared with 
the Gauss form and the lower triangular ma- 
trix collects the steps necessary to get the Gauss 
form. The diagonal elements are usually set to 
one 1,1 = lb2 = 153 = 1 for convenience and because 
we have more unknowns than equations. As you 
already saw in the Gauss elimination, it is practi- 
cal to change row and column entries. Therefore, 
algorithms use a permutation matrix P to change 
the rows of matrix A and the permutation ma- 
trix Q to change the columns of matrix A. A is 
multiplied by P from the left and by Q from the 
right. A permutation matrix has n 1s and other- 
wise zero entries. Each row and each column has 
only a single 1. It could look like: 


0 1 0 
P=|1 0 0 
0 0 1 


The LU decomposition is then: 
PAQ=LU 


The solution of the system with linear equations 
can be obtained with setting the LU decomposi- 
tion into the general equation set form: 


Ax=b LUx=b 


Ly = band Ux=y 


Solving first Ly = b and then Ux = y gives us x 
as shown in the following Example 10 and the 
video: 

YouTube: LU decomposition 


* Polish mathematician: Tadeusz Banachiewicz (1882— 
1954). 


Example 10: LU decomposition of 2-by-2 matrix 


We have the system: 


6 1 Xy) 1 

4 2} |x| |2 
We dissect the matrix A into the lower and upper 
triangular matrix: 


6 1) [li Of} fur we 
4 2) [hr be} 0 wae]? 
We obtain the original matrix if we multiply the 


lower with the upper matrix. We obtain 4 equations 
with 6 unknowns: 


l1-u4;+0-0=6 

y-uj2+0-u2 =1 

Input +l2-0=4 

Ip) -Uy2 +h +22 = 2 
This underdetermined system allows us to set 2 
variables to arbitrary non-zero values such as 1 for 
the diagonal elements of the lower matrix: J); = 
lng = 1. This is up to you to decide. We would 
get infinitely many ways as there are also many 
ways to perform a Gauss elimination. We need only 


1 convenient LU decomposition. The following 
matrix entries are then: 


2 4 
=6 =1, h=s =o: 
ui) 7 W2=N, I= 3, W= 3 


Substituting these values into the LU decomposi- 
tion above yields: 


6 | E i | 
= = LU. 
|: 2 2 ilo ¢ 
Solving Ly = b gives: 
1 0 YI _ 
2 1} |y2| [2 
and thus y; = | and y2 = : Solving Ux=y 
6 1 X{ = 1 
0 ; x2) ; 


gives us the solution x2 = 1 and x; = 0. Let’s check 
whether the solution is correct: 


blll G 
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Yes, it works. 

Extra: 

In the Gauss elimination, we would multiply the 
first row by E (from L) and subtract this from the 
second row: 


6 1 k | 
a 

2 2 4 

42.6 9-241 0 4 


Determinant: 

The determinant is easier to solve with the LU de- 
composition because it depends only on the diago- 
nalelements. The determinant is det(A) = det(L) det(U). 
The determinant of the matrix det(A) = (1-1)-(di1l22)- 
(ujju22) =1-1- (3 -6) = 8 is only dependent on the 
diagonal entries. 


The storage demand for the lower and upper coef- 
ficient matrix can be huge. Thus one can make use of 
many zero values and use iterative procedures with esti- 
mators. An example is the Gauss-Seidel method,” which 
needs much less storage but cannot be used for paral- 
lel computing. For matrices larger than n = 1000, the 
Strassen algorithm? might be faster with lower numeric 
stability. Only the Coppersmith-Winograd algorithm* 
is even faster but only for matrices too big for modern 
computers. 


@ Eigenvectors and Eigenvalues 

Modeling complicated systems can result in very ugly 
mathematical problems. Instead of solving these kinds 
of problems, mathematicians transform the space until 
the problem becomes easier to solve. The solution of 
the nice, beautiful system can then be transformed back 
to the original space. Imagine, you are sitting in your 
room, and you damn the authors of this script, the ex- 
pectations, and math in general. Now, you squeeze and 
stretch the dimensions of your room until this sheet of 
paper becomes infinitely small and your head so big that 
you master its challenges without any problem. After 
you are done, you multiply your entire existence with 
the inverse of the transformation matrix and find a fully 
understood and solved script in front of you. How com- 
plicated a problem is also depends on psychology. Forget 
everything around you; allow yourself to make mistakes 
and to be a child on discovery. The exam is not impor- 
tant, but the excitement to learn and the willingness to 
improve is. This is my favorite psychologic transforma- 
tion matrix. Give it a try. So why is this imagination 
important for the Eigenvectors? Eigenvectors are the 
directions in space, which are not deformed during 
transformation. Imagine, everything gets squeezed and 


? German mathematician: Philipp Ludwig von Seidel (1821—1896). 

3 German mathematician Volker Strassen (1936—today). 

4 Israeli American computer scientist Shmuel Winograd (1936— 
today) and American mathematician Don Coppersmith (1950— 
today). 


changes—just not in the directions of these Eigenvectors. 
The directions along the Eigenvectors become merely 
scaled by the Eigenvalue. Would it not be wonderful to 
see a problem related to these vectors, compared to an 
original unfavorable coordinate system? But how can we 
find out which directions are not influenced by a trans- 
formation? Let’s say we have a transformation matrix A 
and look for a vector v that can only be scaled by a factor 
but not deformed by a matrix. Then we could just say 
that the matrix is only like a factor. Let us call it A and 
imagine the Eigenspace: 


Av=Av 


The idea is not bad, but the right side does not have the 
same structure as the left one. The identity matrix helps: 


(A—Al)v=0 


The matrix (A — AJ) should now be singular, to get non- 
zero solutions for the Eigenvector v. The Eigenvector 
v = 0 would be trivial and useless. A matrix is singular 
if its determinant is zero: 


det(A — AI) =0, 


which also means that the matrix is not invertible. We are 
interested in vectors, which are scaled but not deformed 
by a transformation matrix A multiplied by the left. We 
calculate first the Eigenvalues and than the Eigenvectors. 


Example 11: First Eigenvalues than Eigenvectors 


We take the matrix: 


( §) 


and get the Eigenvalues by: 


3-A 6 
Ana=[P | 
det(A — AI) =0 = (3—A)(4—A)-6 
0=A?—-7A +6 


0=A*+pA+q_ (p-q-equation) 
P 


q- 
De CE 
Mz=—5+ 5) 4 
1 eee 6 hee 
Ma=5+ (Z) —6 
745 
pee 
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The term det(A — AJ) creates a polynomial which 
we call a characteristic polynomial with maximal n 
solutions for a nxn matrix. Setting it to zero causes 
it be renamed a characteristic equation. Now we 
have the Eigenvalues A; = 6 and A2 = 1 and search 
the related Eigenvectors starting with A, = 6: 


(8 J-f s)ene 
Be | v=0 


to obtain the Eigenvector v, = | directly. We can 


also write it in more detail: 


—3vqg + 6vy, = 0 
Vq — 2v,p = 0 


resulting in vg = 2v, for the first equation. The 
second equation gives 2v, — 2v, = 0 which means 
that v, can be everything but the equation set is 
still solved. We have infinitely many solutions. So 
we can set v, = 2 with which we know that vp, must 
be one v, = 1. We obtain the non-unique solution 


v= Al . The reason is that we can only stretch the 


1 
Eigenvectors. 
We proceed with the second Eigenvalue A2 = 1: 


(8 J- ene 
i s|”=° 


to obtain Eigenvector vz = 


ae | 
ara 
ww 
es 


Summary: 


1 Use the determinant of (A — AJ) to get a polynomial 
of degree n. 


2 Find the Eigenvalues by identifying the roots of 
the characteristic equation with det(A — AJ) = 0. 


3 Find to each Eigenvalue the associated Eigenvector 
via (A—AD)v =0. 


If all Eigenvalues have another value, we call them sim- 
ple and the associated Eigenvectors are independent. If 
an Eigenvalue appears several times, we say that the 
Eigenvalue has multiplicity k. Software programs like 
MATLAB return normalized Eigenvectors with unit length. 
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YouTube: Eigenvalues and Eigenvectors 


Fast equation for 2-by-2 matrix 
For the dimension 2 to 4, we have fast equations to solve 
the Eigenvalues. From the matrix: 


a b 
c d 
we obtain the characteristic polynomial: 


(ad — bc) 


ae i “a =? —(a4+d)a4 
=A? — Atr(A) + det(A) 
from which we get the Eigenvalues with the equation: 


ae tr(A) + \/tr?(A) — 4 det(A) 


2 


with the distance between Eigenvalues: 


A=: y/tr2(A) —4det(A) 


which will play a role in the stability theory in Block 3. 
Additional information: for a matrix of higher dimen- 
sion, it already seems to be more complicated. 


det (aI —A) =0 
1 
= 0° — a tr(A) — a5 (tr(A”) —tr?(A)) —det(A) 
and for Dimension 4 it is not considered overly useful. 


Example 12: Get Eigenvalue via fast equations 


gives tr(A) =4—3 = 1, det(A) =4(—3) —3(—2) =—6 
and the characteristic equation: 


0=A*-1A-6=(A—3)(A42) 


with Eigenvalues 3 and -2. The Eigenvectors are 
worked out using: 


1 3 6 3 
a-a=[, A), asa=[§ 3] 


(3, —1) for Eigenvalue 3 and (1,—2) for Eigenvalue 
-2. 
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Figure 10. PCA plot of cell lines. Location and sphere size 
indicate which cell lines behave more similar than others. 


Cell lines which are close behave similarly. Source: [5]. 
Copyright © 2013, Macmillan Publishers Limited. 


@ More on biological data as a matrix 

Often, the network topology is unknown (but of course 
existent). Detected molecules change with time, and 
their behavior depends on their connection to other ele- 
ments. Among them, interlinked elements correlate or 
anti-correlate with each other. The more different pertur- 
bations are applied, the more likely a valid connection 
can be detected. This is a property which can be used 
for, e.g., reconstruction of networks and for data-driven 
modeling. The simultaneous observation of more than 
one outcome variable is the definition of multivariate 
statistics. Related data-tables with multiple variables 
are actually nothing other than matrices with molecular 
features as rows and experimental datasets, conditions, 
or organisms as columns: 


Cell line 2 


Microarray 1 
Microarray 2 
Cell line 1 


Protein 1 


Gene 1 Protein 2 


Gene 2 


The columns represent the dimensions or coordinate 
axes of the data and the rows contain coordinate val- 
ues on the given coordinate system. Consequently, each 
gene is a vector in a coordinate system spanned by the 
matrix columns. 

To illustrate the application of matrix calculation, we 
will briefly discuss here the statistical method of Prin- 
cipal Component Analysis (PCA): sometimes, we want 
to know whether the columns, with all the row entries 
projecting to them, have subsets that are more similar to 
one another than to other data points. Now, imagine a 
dataset with 40 coordinate axes representing 40 microar- 
rays, and then try to place tens of thousands of points 


representing genes into this coordinate system. Horri- 
ble! But maybe we can find another coordinate system 
which is visually more inviting and more informative. 
Let’s say 40 data points representing microarrays in a 
3-dimensional coordinate system, where the biggest dif- 
ferences between the microarrays are selected and are 
thus pronounced. The variability in each row causes the 
differences between the microarrays. Data analysts use 
Principal Component Analysis (PCA) to do exactly this. 
One starts with 1 additional coordinate axis which ex- 
plains the most variability in the old dataset. This is the 
first principal component. The algorithm then searches 
for another coordinate axis which is orthogonal (90°) 
to the first principal component and again describes as 
much variability as possible. This procedure goes on 
until all genes project to the new coordinate system. Af- 
ter one has all these principle components, one neglects 
stepwise the components representing the lowest vari- 
ability until 2 to 4 remain. Ideally, the remaining princi- 
pal components explain more than 80% of the variability. 
Principal Component Analysis is used, e.g., as standard 
quality control for microarrays. Are the treated and the 
untreated samples in two separate groups? Is one repli- 
cate completely different from the others and might it 
represent an outlier? As an example, PCA also helps us 
to understand the microbiome better. Are different bac- 
teria types more closely related to others and how does 
it change if they are exposed to drugs or another diet? 
A collection of micro-organisms might build one cluster. 
After the diet is changed, the micro-organisms might be 
found in another cluster. Micro-organisms which have 
not changed might not be affected by the diet change. In 
Figure 10, you see an example with different cell lines 
and their response to TNF-related apoptosis-inducing 
ligand (TRAIL) [5]. The dataset contains the base level 
of 17 core apoptosis proteins in 11 melanoma cell lines 
under different conditions with 612 measurements in 
total. The authors used network information to group 
proteins to functional network motifs, which allowed 
a higher accuracy in estimating the apoptosis-inducing 
impact of drugs. 

YouTube: PCA step for step 
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Additional reading 


The PCA is related to the partial least square 
regression (PLSR). Here, we separate our data 
matrix in two parts, whereby the upper part rep- 
resents phenomenological readouts such as via- 
bility: 


condition 1 
condition 2 


Viability 
Cell death 
Protein 1 
Protein 2 


Both blocks are subject to a dimension reduction 
approach but now with the aim to maximize 
the co-variance between both blocks. PCA uses 
the variance and PLSR uses co-variance. Such 
an approach is, e.g., used for systematic drug 
testing and for initial hints for network modeling. 
An example is shown in Figure 11. The authors 
created an high-dimensional data block with 5 
different RAF/MEK inhibitors, 7 doses, 5 time 
points, 21 protein levels, and 10 other cell lines 
[6]. Here, drug dose is the first and drug type is 
the second principal component. The adjusted 
variable importance in the projection (VIP) 
explains which protein had the most prominent 
negative or positive impact on the cell viability 
at which time. This led to the identification of 
a consistent down-regulation of the JNK/c-Jun 
pathway upon RAF/MEK inhibitor treatment at 
early time points, but an up-regulation of 6 cell 
lines at later time points. In 4 out of 10 cell lines, 
JNK/c-Jun up-regulation caused a subset of cells 
to become quiescent and apoptosis-resistant [6]. 


We have learned that matrices are used in several 
applications in biology. Matrices can represent biochem- 
ical networks and data caused by them. In the following 
section, we will learn how to work with matrices. These 
basics are fundamental for the remainder of the book and 
very useful for your career, independent of whether you 
want to mainly work with the keyboard or the pipette 
in the future. It will help us to analyze and understand 
what we do. 
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Time (hr) 
Correlation with viability mio omm5 10 mm 24 mm 48 


<—_,* 
LOXIMVI MMACSF MZ7MEL RVH421 SKMEL28 WM115 WM1552C 


p27 Kip1 NIA 
p-Histone H3°) 
cPARI 


Total cJun 
p-JNK‘TI88788) 
cr172) 


p-AKT(1208) 
p-p90RSK™=) 
p-p90RSK'S=") 


N/A 


42024 42024 42024 42024 42024 42024 42024 
VIP score VIP score VIP score VIP score VIP score VIP score VIP score 


Figure 11. After the perturbation of the system with differ- 
ent conditions, we see which protein on the left correlates 
positively or negatively with the cell variability in differ- 
ent cell lines such as LOXIMVI or SKMEL28. The cell 
lines behave quite differently. The importance of each 
player depends on the time, so phosphorylated Histone 
H3 is first correlated with the cell viability after 24h or 
48h. Figure source (cropped): [6], Licence: CC BY 4.0. 
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3. Exercises 


@ Representation 
Write the following systems of equation in matrix form: 


yy = 2x, +322 

y2 = x2 — 5x1 (3.1) 
xX=1-x-y 

a a (3.2) 
Yr = 81141 + by2.x2 + 6)3.x3 

¥2 = Opax2 + box) + 6p3x3 (3.3) 
¥3 = 6333 + 632x2 + 631%] 


Write the following matrix equation as a system of dif- 
ferential equations: 


d (xi\_ (3 —2\ (x 1 
w(a)=G s)G)+)e 
@ Basic operations on matrices 
Perform the following calculations: 
3-2 4 —4 
& 5 )+(; a) (62) 
3-2 4 —4 
(; 5)-( °)- ee) 
3-2 
5 (; 5 ) = (3.7) 
3 -2\ (4 —-4 
( 3)G o)- 9) 
4 —4\ /3 -2 
G o)G 3)- 9) 
Transpose the following matrices: 
T 
3 -2 
(3 2’ (310 
3-2 1\* 
1 5 O} = (3.11) 
2 -1 7 
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§ Determinant 
Find the determinants of the following matrices: 


3 -—2 
(62 is 
3 —2 | 
1 5 O (3.13) 
2 -1 7 
3 —2 2 
1 5 O (3.14) 
2 -1 0 
3 —2 2 2 
1 5 1 2 
> -1 -1 2 (3.15) 
1 2 3 1 
® Rank and inversion 
Determine the rank of: 
3 —2 
(3, 2) aa 
3 —2 
( : z (3.17) 
3 —2 1 
1 5 O (3.18) 
1 2 3 
1 1 -3 
1 -1 2 (3.19) 
2 0 -1 
Invert the following matrix: 
3 —2 
( a? ) (3.20) 
21 -1 
02 I (3.21) 
5 2 -3 
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@ Linear systems of equations 

Solve the following systems of equations (Ax = b). Con- 
sider thereby the rank of the coefficient matrix A and of 
the augmented coefficient matrix (A|b). 


2x1 +x2 —x3 =0 


3.22 
x2 +x3 =0 ( ) 
2x1 rx. = 0 (3 23) 
xy —-x%2 =0 
xy +2x2 = 1 (3.24) 
xy + 2x9 =2 : 
5x1 +x2 =2 
3.25 
X1 — 2x2 =7 ( ) 
2x1 +x2—x3 = —5 (3.26) 


x2 +143 =1 
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Notes 
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4. Solutions 


Do not betray yourself! 
Exercises 
@ Representation 
Task 3.1 
yy = 2x, +3x2 2x1 + 3x2 YI 
y2 = X2—- 5x1 —5x +r X2 y2 
2 3\ (x\ _ 
—5 1) \x) 
Task 3.2 


X=1-x-y =| =] 
6 
y =] +x-—y 1 —1 
Task 3.3 
Yt = 81141 + Sy2.x2 + 6) 3.x3 
2 = Op2.x7 + by x1 + 63.3 
¥3 = 6333 + 632x2 + 631%} 


yi Oi 812 -613\ (x1 
yo} =| dn do2 63 | | 20 
¥3 631 632 633) \x3 
Task 3.4 
d xX) \ 3. -—2 XxX] + 1 
dt\xm/) \1l 5 x2 0 
ad = 3x, —2xo+u 
d. 
| =x, +5x2 


@ Basic operations on matrices 
Task 3.5 


Gs)+G o)=G 5) 


Task 3.6 


3 -2\/4 —-4 - 
aes 
(1-4) + (5-3) 
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Task 3.9 


Go) 5) 


12-4 -8—20\ (8 —28 
9+0 -6+0/ \9 —-6 
Task 3.10 
5 = Pe 
1 5 ~A\-2 § 
Task 3.11 
= 1y* s if 2 
1 5 O ={—2 5 -l 
2 -1 7 1 0O 7 
@ Determinant 
Task 3.12 
3-2 3-2 
act (} =i 3|-6 5) —(1-(-2)) =17 
Task 3.13 
3 2 1 
det} 1 5 O] =(3-5-7)4+(-2-0-2)4+(1-1-(-1)) 
2 -1 7 
—(2-5-1)—(-1-0-3) —(7-1-(-2)) 
= 108 
Task 3.14 
3 2 2 
det} 1 5 OJ] =0+0—2—20—0—0=-—22 (41) 
2 -1 0 
Task 3.15 
3 —2 2 2 
1 5 1 2 
2 -1 -1 -2 
1 2 3 1 
5 1 2 1 1 2 
=3)-1 1 2} — (—2) |2 1 2 
2 3 1 1 3 1 
1 5 2 1 5 1 
+2|2 1 2) —2|2 1 1 
1 2 1 1 2 3 
= 3(-5-4—-64+4+30+1) 
2(-1-—24+124+2+6-2) 
2(-1—10+84+2+44-10) 
—2(-3-5+4+1+2-30) 
) = 138 


B® Rank and inversion 


Task 3.16 
3. —2\04+2-1/3 —2 ; 
is rs _ i 9 ) Ranks 1 
Task 3.17 


3 -—2\T1-2-1/3 -2 . 
€ 2 _ ¢ 1) Ranks 2 


Task 3.18 


3 —2 1\I-3-H/0 —-17 1 
1 5 O = 1 5 0 
1 2 3/ Il-ll \O —3 3 
Task 3.19 
1 -3 Il-I 1 1 -3 


1 -l 2 = |0 —2 5 


— Rank is 3 


2 0 -1/M-2-1\0 —2 5 
it — 
i 0 -—2 5 | >Rankis2 
~ \o 0 0 
Task 3.20 
3 2 
E 3 |=9412=21 
—l 
a 22 Aes 2 
6 3 ~ 21\-6 3 
Task 3.21 
a es | 
0 2 1/=-124+54+10-4=-1 
5.2 8 
> t i 
i a | = 
5 2 23 
+[2-(—3)—(1e2)|  -—[O(—3)—(1a3)] 022 
eae) a2), Se 
hit {— 123) —[2-1-(-1-0)] 4[2-2 
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Task 3.22 


2x1 +x. —x3=0 
x2 +x3=0 


We start with the first equation: 


I-III: 2x; —2x3=0 
Ixy =A]: 24—2x3=0 


and use the parametric A. Then we solve the second 
equation: 
x3=AinIl: x.+A=0 


x2 =A}. 


and get the parametric solution or general solution (x,,x2,x3) = 


(A,—A,A). The matrices are: 


2 1 -!il : 
A=(5 1 ||) -* Rankis2 
2 1 -1/0 ; 
(alb)=(5 1 1 [6 ) Ranks 2. 


We have 3 variables but only 2 equations. One of the 
variables has to be chosen (or treated as a parameter) 
and the solution for the 2 other variables will depend on 
this choice or the parameter. Because we have infinite 
different choices for this parameter (often called lambda 
1), we have infinite solutions. 


Task 3.23 
2x, +x. =0 
Xj —xX2 =0 
I+I: 3x; =0 
x1 =0 
xy3=Oinl: O-x.=0 
x =0 


The matrices are: 
A= (; i) — Rank is 2 


(alo) =( 5 1 |p) Rankis2. 


We have 2 equations and 2 variables with exactly 1 solu- 
tion. 


Task 3.24 
x, +2x2 = 1 
xy +2x =2 
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I-H: o0=-1 ¢ 


The equation set is inconsistent. The matrices are: 


1 2 : 
A= é 5 — Rank is 1 


(alb)=(j |) Ranks 2 


Task 3.25 
5xyj +x2 =2 
xy —2x%) =7 


Alternative 1: 


2-I4+T: 11x; =11 
xy=1 
x)=linIl: 1—2x,=7 
x2 = —3 


The matrices are: 


5 1 : 
A= (? 2 — Rank is 2 


(alb)=(5 1, |7 ) > Rankis2. 


We have 2 equations and 2 variables with exactly 1 solu- 
tion. 


Alternative 2: 
We use Cramer’s Rule now with the determinants: 


det(A) = ; - =-10-1=~-11 
2. I 
det(A;) = & _9| = —-4-7=-11 
det(A2) F : 35 —2 = 33 
which finally give: 
_ det(A1) =i. 
1 Get(A) 11 
d 
a et(A2) _ 33 a 
det(A) —11 
Task 3.26 
2x1 +X2—x3 = —5 
x2 +2x3= 1 


Alternative 1: 
We start with the first equation: 


I+Il: 2x,;+2x2 = —4 
Ix =Al: 204+2m=-4 
x2 = —2-A 
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and then we solve the second equation: 


m=-2-AinI: —2—-A+x,=1 


fs=374] 


Alternative 2: 
2 equations with 3 unknowns. 1 is flexible. Choose 


f=") 


I-Il: 2x, — 2x3 = —6 
Il: x2 +x3 = 
x2 =1-A 


The matrices are: 


2 1 -!i1 . 
a=(‘ 1 ||) ++ Rankis2 
2 1 -1)]5 . 
(4b) =( 4 \ 1 |} ) > Rankis2. 


More variables than equations. Thus, we have infi- 
nite solutions. 


The number of solutions of linear equation sets can also 
be determined with the Rouché—Capelli theorem? sum- 
marized in Table 1. 


5 French mathematician: Eugéne Rouché (1832—1910). 
Italian mathematician: Alfredo Capelli (1855—1910). 
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Table 1. Overview: Solutions of linear equation systems 
(Rouché—Capelli theorem) 


homogeneity 
Ax=b Ax=0 
m equations (homogeneous system) 
n variables 


Rk(A|b) # Rk(A) — system unsolvable not possible for b= 0 


Task 3.24 homogeneous systems 
always solvable 
Rk(A|b) = Rk(A) system solvable 
unique solution trivial solution 
m=n (x=0) 
Task 3.25 Task 3.23 


non-unique solution non-trivial solutions 


Task 3.26 Task 3.22 


Notes 


Chapter 2: Metabolic modeling 


Thomas Sauter, Marco Albrecht 


Motivation 

Metabolism is a mirror of many biological processes and extracellular metabolites can be accessed with relatively 
robust measurements. Moreover, basic analysis with a stoichiometric matrix (representing a metabolic network) 
can be adopted by many researchers because it is mainly based on linear algebra. However, well-trained experts are 
needed for the analysis of dynamic behavior, contextualization, and advanced research questions. Metabolism can 
also be analyzed on a large scale, e.g. with graph-theoretical approaches or in more detail with deep, mechanistic, 
and dynamical insights. In this chapter, we introduce Stoichiometric Network Analysis (SNA) which is an 
approach that is somewhere between detailed and large scale. SNA helps to exclude biologically unrealistic 
scenarios and, instead, gives us a variety of possible flux distributions. To thereof identify the biologically 


meaningful solutions, we need to constrain the solution space further, e.g. with measurements. We have two 
possible methods at hand. Either we impose known fluxes and maximize, e.g. the growth rate in a constrained 
model with Flux Balance Analysis (FBA), or we impose measured fluxes and minimize residuals during numeric 
matrix manipulations with Metabolic Flux Analysis (MFA). FBA also helps us to solve genome-wide problems, 
while MFA scrutinizes smaller models and allows the integration of isotope-labelled metabolites. Either way, the 
insight we obtain is crucial to understand the biology of metabolism. 


Keywords 
Metabolic modeling — Stoichiometric matrix — Flux Balance Analysis — Constraint-Based Modeling 


Contact: thomas.sauter@uni.lu. Licence: CC BY-NC 


Contents introduced in Chapter 3). Metabolic models can be built 


with different levels of detail and in various ways. We 


1 Lecture summary 4 start with the following simple classification, in which 


1.1 M Stoichiometric Network Analysis... . 2 the capital letters in the reactions are the concentrations 
1.2 Mi Integration of experimental data... . 11 of substances and the reaction arrows or edges define 
1.3 Mi Constraint-Based Modeling ...... . 44 the reactions, which can symbolize enzymes, fluxes, and 
1A. WM Probleme & disclission. «oe seu 1g other transport systems. Principles of mechanism-based 
1.5 Ml Metabolic Flux Analysis... 2.2... 19 models and dynamic analysis will be addressed in Chap- 
ters 3 and 4 of this book. 
References 22 
ease 93 Table 1. Bottom-up approaches: 
; interaction-based graph theory 

3 Solutions methods 

Do not betray yourself! 26 2 
1. Lecture summary (topology) 


We learned about the essentials of graph theory and de- 
rived the first stoichiometric matrix (Chapter 1). Nodes 
and the interaction between them represent the topol- 
ogy of networks. Such networks can be written in matrix 
form. The introduction to linear algebra was of great 
importance to entering the field of metabolic modeling. 
Metabolic networks mainly describe enzyme-controlled 
fluxes of metabolites within a system. We can now study 
different methods for the modeling of metabolic net- 
works except for dynamical mechanistic models (to be 
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stoichiometry-based stoichiometric network 
analysis 
A+B==C 


mechanism-based 
A+B==C+D 
1 


dynamic analysis 


https: //doi.org/10.11647/OBP.0291.02 
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1.1 Mf Stoichiometric Network Analysis 
The stoichiometric matrix 


For balancing, we need: 

m... number of substances 
r... number of reactions 

n... stoichiometric coefficients 
vj... rate of reaction j 


njj... stoichiometric coefficient (conversion) of metabo- 
lite i in reaction j 


S;... concentration of metabolite i (also: c;) 


These are elements of the following general equation 
that describes the change of substrate concentration S; 
over time: 

dSi * 

Ts = Si = Leni 


i=1,...,m 


= NyV_ +NiQV2 +... + NirVy 


The change of substrate concentrations is a consequence 
related to both the general stoichiometry of the reac- 
tion network—as indicated in the stoichiometric matrix 
N—and the given flux vector v. The respective matrix 
representation is: 


$=Nv S = (S1,S2,...,Sn) 
= (V1,V2,-.+5Vr) 


and the stoichiometric matrix is: 

N= i 

= {nj} J o=l..r 

The columns of the stoichiometric matrix represent the 
reactions and the rows represent the metabolite concen- 
trations. 
For example, a reconstruction of a "genome-scale net- 
work" for E. coli encompasses 1136 unique metabolites 
and 2251 reactions. This results in a stoichiometric ma- 
trix N with 1136 rows and 2251 columns resulting in 
2557136 entries. However, the matrix is usually sparse 
and only >25000 entries are actually non-zero [1]. 
The stoichiometric matrix: 


e does not change over time (invariant) 
e usually has many zero entries 


e contains pseudo-reactions (uptake, excretion, trans- 
port, growth) 


e is crucial for structural analysis with linear algebra. 
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Concentrations are usually written with brackets [], but 
in the context of this chapter we always assume con- 
centrations when using a given molecule name in an 
equation, and thus write concentrations without brack- 
ets if not stated differently. E.g. we write G6P for the 
concentration of glucose-6-phosphate instead of [G6P]. 
Also note that some reactions can be reversible. In this 
case, we can only consider the net reaction rate vr — v, 
(from left to right) to create the stoichiometric matrix. 
Thus, some fluxes can be negative, if the backward re- 
action v, is larger than the forward reaction vr. Alterna- 
tively, we can split a reversible reaction into two elemen- 
tary reactions (see Example 1). 


Example 1: Glucose to fructose 
PGI reaction: GgP == FeP 


Two metabolites m = 2 
Case 1: two elementary reactions (r=2): 


PGI reaction: Ge6P = FeP v=const. 


ny = —1; nq) = 1; 142 = 1; 22 = —-1 
—l 1 
w=(' 4) 


> S;= G6P =nNyVp +nNy2V2 = —Vp + V2 


So = FeP = novi +n22v2 = +V1 — v2 


Case 2: one net reaction (r=1): 


PGI: GeP —=FeP  v=const. 
ny =—l1,n1=1 
=i 
w= (7) 

S$; = GeP = nv = —-v 


So =FeP = nj;v=-+v 


Example 2: Oxohydrogen reaction 


Reaction: 2H» + O7 == 2H)O 
m=3; r=13; ny = —2; m1 = —1;5 n33 = +2 
Sy = H = (—2v¢ + 2vp) =—2v 
Be OR: SI Ve My) 
$3  =H,0 = (+2v¢ —2vp) +2v 


The stoichiometric matrix is: 


reactions 
-—sN 


N=|-—1] Oo } metabolites 
2 } HO 


with the final representation in matrix and equa- 
tion set form: 


Hp —2 H> = —2v, 
Oo =|-l (v1) => Oo =-V{ 
H20 2 HO =+42y, 

ee sonasessces 


Ti ai le vine Pe i mmm 


Figure 1. A simple metabolic system. The grey dotted 
line represents the system boundary. The A Molecule 
is transported from outside to inside and is then con- 
verted to Molecule B via two possible reactions. Finally, 


Molecule B is leaving the system. 


Example 3: Simple metabolic network 


As shown in Figure 1, there are two possible ways 
to produce Metabolite B, either via v2 or v3. All net- 
work edges can be weighted so that we do not have 
an incidence matrix, but a stoichiometric matrix: 


Vyov2 v3. V4 
1 -1 -1 O |Ain 
B 


N= | 0 1 1 —-1| Bin 
-l1 0 OO |Aou 
0 oO O 1 | Bou 


where the reactions determine the columns and 
the metabolite concentration determines the row 
entries. External metabolites are not balanced in 
this approach because the steady state of external 
metabolites (in our example Aj and Bow ) cannot 
be assumed. Consequently, we can reduce the sys- 
tem to: 
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Example 4: Enzyme kinetics 


al v2 
Balancing: E+5 pe ES > E+P 
Resulting in: 


S -1 1 0 
E —t a 4 " 
ES’ | t =t =1) |" 
P 0 0 1 ibe 


Figure 2. Another metabolic network/system with two 
inputs and three outputs. The transport of Molecule B is 
bi-directional. The system is controlled by the transport 
of the external concentration of A and B and produces 
Product P but can also produce B and E. 


Example 5: Larger metabolic network 


A slightly larger metabolic network is shown in 
Figure 2 with the reversible reactions: 


rev = {R2,R8} 
and irreversible reactions: 
irrev = {R1,R3,R4,R5,R6,R7,R9,R10} 


Note that the forward reactions (v2 : Boy — Band 
vg : B + C) were considered in order to create the 
stoichiometric matrix N: 


10 0 Oo -1l -1 -1 0 0 O]JA 
0 1 0 0 1 0 oO -1 -1 O|B 
N=/)0 0 O 0 0 1 0 1 0 —-1;C 
0 0 0 0 oO O 1 0 O -I1;D 
00 0 -!1 0 0 0 0 90 1/E£ 
00-1 0 0 0 0 0 1 1 |P 


Example 6: Branched metabolic network 


al v2 V4 
= $; => 2S = 
AWV3 
S3 


What does the stoichiometric matrix N and the bal- 
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Figure 3. Flux distributions of Example 7. The balance 
of input flux v; and output flux v3 requires a detour via 
Metabolite B with flux v2. 


ance equation look like? 
The stoichiometric matrix is: 


reactions 


es a a 
N=(0 2 0 -2])S)$& 
0 0 1 O78; < 


The balance equation of the system in the matrix 
form is: 


ce af oA 
Sls(@ 2 oo sap] 
Ss 00 1 o/{® 
(3x1) 4x4) vA 


and in the equation set form is: 


S =v1—-v2—-V¥3 
So = 2v9 —2v4 


$3 = V3. 


System in steady state 
Usually we would like to look closer at our system and 
see what happens if: 


e fluxes are constant and do not change over time 


e metabolite concentrations are constant and do not 
change over time (S$ =0). 


Therefore, we look at the steady-state solution: 
Nv=0 


The steady-state solution is available and appropriate as 
soon as the states of the system no longer change and 
thus time dynamics are not taking place. 

See Example 7 with Figure 3. Here, we have the sit- 
uation as described in the motivation: we have many 
non-trivial solutions but we do not yet know which so- 
lution is biologically meaningful. Therefore, we need 
to constrain the system further by integrating experi- 
mental data. In the practical work, one rate might be 
measured, e.g. v} = 1 and then the other fluxes (v2, v3) 
can be calculated accordingly. 


Example 7: Flux in steady state 


The system in Figure 3 can be written as a system 
of elementary reactions: 


0==A 
A= |B 

v3 
A+B=—=0 


and the related equation set: 


A=v1—v2—v3 =0 (1) 


B=v.-v3=0 (2) 


which can be written in the matrix form: 


fj. a7 
F 1 | v2| =0 rk(N) =2<3=r 
"| —S 


Because the rank of N is smaller than the number of 
reactions, we have to deal with an infinite number 
of non-trivial solutions. 
Solution: Assume: v; = a. 
Beginning with Equation 1: 

a—v2—v3=0 


v2 =a—V3 
which is then put in Equation 2: 


a—v3—v3=0 


a=2v3 
a 
WRB 5 
2 


which in turn gives the solution if it is put into the 
previous equation => v2 =a—§ = §. The solution 


is: 


val 1 
v2| =a}0.5 ;aER 
V3 0.5 


Elementary flux modes (EFM) 

If we cannot determine a unique solution of the flux 
network due to too many unknown rates (see also rank of 
the stoichiometric matrix), then we can study elementary 
flux modes and the kernel matrix, both of which allow 
us to describe all possible solutions. Some applications 
thereof are: 


e testing a set of enzymes for the production of a 
desired product 


e detecting non-redundant pathways 
e analyzing effects of enzyme deficiency 
e identifying drug targets. 


Flux modes describe possible pathways from one metabo- 
lite to another if the system is in steady state. Elementary 
flux modes (EFM) are a set of non-unique and linear- 
independent basis vectors vz, which can be summarized 
in the kernel matrix K. The kernel matrix also fulfils the 
steady-state relationship: 


NK=0 


Example 3 delivers for the molecules A and B the follow- 
ing stoichiometric matrix: 


12 
it =<t =f t 4 
Alo. 4. 4 eal ee ei 
1 2 


r—rk(N)=4—2=2 


The kernel matrix K has as many columns as the stoichio- 
metric matrix N has linear-dependent columns (number 
of columns/reactions r minus the rank of N). The de- 
termination of the kernel matrix is shown in Case Box 1. 
The EFMs in the kernel matrix are non-unique and can 
thus consist of all vectors which: 


e fulfil the steady-state condition 
e are linear-independent of each other. 


Non-unique means that the kernel matrix can have many 
different solutions which fulfil the steady-state relation- 
ship NK = 0. Often, we are just interested in getting one 
of these possible solutions. Now, you could say, you find 
more flux modes than the indicated basis vectors (EFM) 
in the kernel matrix K. Well, the flux modes v can be 
reconstructed by the non-unique basis vectors (EFM) in 
the kernel matrix v;: 


I Il l-I -I 50 [-—2I 
1 2 1 —1 10 —3 
ve: |L] &]1 v: JO};/—-1l];] 5 );}]-1]; 
0) 1 1 0 5 —2 
1 2 1 —1 10 —3 
independent dependent 
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In our example, another possible kernel matrix would 
be: 


1 
0 
1 
1 


RP ORF 


The kernel matrix can provide information about blocked 
reactions. Blocked reactions are pathways without a 
metabolic flux. Rows in the kernel matrix which have 
only zero entries, é.g.: 


K=;0 0: 0] blocked. 


indicate blocked reactions and two examples can be 
found in Figure 4a and 4b. 


{ 


—+A—+ BCD 


—B 
ue D+ 


(a) Blocked reactions by dead-end or lack of input. 


— 


oe —A —- 
AB Cy a 

= Se 
™ 7 
ee Ce 


(b) Coupled reactions reduced (left) and null-space analysis 
via the kernel matrix (right). 


The kernel matrix also refers to coupled reactions. Cou- 
pled reactions are reactions which always appear to- 
gether in the same ratio. Their rows only differ—if at 
all—in a scalar factor: 


1 

0 -l 
1 1 

lt 8 

Blocked reactions could be removed and coupled reac- 
tions could be simplified, which would reduce the num- 
ber of columns r in the stoichiometric matrix so that we 
reach another network representation with the property 
r—rkN = 0. Remember that, depending on the rank of 
the stoichiometric matrix, we have two possibilities for a 
solvable homogeneous system according to Table 2. 
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Table 2. Overview: Resolvability of linear equation sys- whereby the value | is the most frequently used 
tems for metabolic models (Rouché—Capelli theorem). number to simplify follow up calculations. The sim- 
homogeneity plification and cleanliness of calculation reduces 

N has Nv=b Nv=0 the risk of errors and is thus a smarter choice. We 


m metabolites (inhomogeneous system) (homogeneous system) 


; get, for the remaining unknowns, the values: 
r reactions 


rtk(N|b) A rk(N) system unsolvable not possible for b = 0 = ki3=0 
homogeneous systems 
always solvable => kig4=1 
rk(N|b) = rk(N) system solvable 
unique solution trivial solution and the basis vector: 
m=r (v=0) 
ee cee ae eee ee he ee et et 1 
non-unique solution non-trivial solutions 1 
m<r => ki = 0 
1 
We either have the trivial solution or the non-trivial so- 2. Basis vector: 
lution: = 


Trivial solution: thermodynamic equilibrium with all Nk, =0 
fluxes equal to zero v = 0. ka 
1-4 =. 0) lee 
0 1 1 1] [kos 
ko4 


Non-trivial solution: under-determined system with rk(N) =0 


Case 1: Calculate the kernel matrix 

. . . . which is as an equation set: 

A kernel matrix only contains a linear-independent 

subset of flux modes, not all. Thus it is a non- x bikes =0 I 
i trix. Let’s start with th le: " 

unique matrix. Let’s start wi e examp eee te eee I 


N= bog ae oe We have 4 unknowns and 2 equations. So, we can 


oi set two unknowns to arbitrary values, e.g.: 


Nullity = Number of necessary basis vectors: a 


PN a With this choice, we get for the remaining unknowns: 
We want to know how many of the reactions r are 
not fully determined by the stoichiometric matrix 
with rank rk(N). > ko = 2 


> ko3 = 1 


F : and a second basis vector: 
1. Basis vector: 


2 
Nk, =0 l 
k — 
ku a ee 
1-1 -1 0] || _» 2 
0 1 1 1) |ks| 7 

kia Building a kernel matrix: 
which is as an equation set: 1\ /2 1 2 
1 1 1 1 
=> ky—ky—k3=0 I =K=(kike)=!] ol ti} }=lo 4 
ki2 +k13—ki4 =0 Il 1 2 1 2 


We have 4 unknowns and 2 equations. So, we can 
set 2 to arbitrary values, e.g.: 


ky=1l, ky=1 


Conservation relations for metabolites 

If the number of balanced metabolites exceeds the rank 
of the stoichiometric matrix, we can identify conserva- 
tion relations such as: 


NADH]+ [NAD] = const 
ATP] + [ADP] = const 


This means that some of the metabolites are dependent 
on other metabolites and do not act independently. Such 
conservation relations can be identified by calculating: 


N'Y =0 


A linear-independent subset of the conservation rela- 
tions build basis vectors in the matrix Y. The construc- 
tion is shown in Case Box 2. 


Case 2: Calculate the Y matrix 


A given stoichiometric matrix is: 


—1 
—1 
. 1 


1 
Nullity = Number of necessary basis vectors: 
m—tk(N) =4—1=3 


We want to know which metabolites can be seen as 
dependent on other metabolites in our system. In 
our case, we have only 1 reaction but 4 metabolites. 
Thus, we have only 1 independent metabolite. The 
remaining 3 metabolites depend on the selected 
independent metabolite. 


1. Basis vector: 


Ny, =0 

Yu 
[-1 -1 1 1] |?) =o 

Y13 

Y14 


is the same as the equation: 


> yu -Y2+y3+)14 =0 


We have 4 unknowns and 1 equation. So, we can 
set 3 to arbitrary values, e.g.: 


yi=1; yro=-1; y3=0 
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With this choice, we get for the remaining variable: 
=> yi4 =0 


and the non-unique elementary flux mode: 


2. Basis vector: 


is the same as the equation: 


=> —Yyo1 —y22 + y23 + y24 = 0 


We have 4 unknowns and 1 equation. So, we can 
set 3 to arbitrary values, e.g.: 


ya=1; y2=0; y3=1 
With this choice, we get: 
=> yo = 0 


and the non-unique elementary flux mode: 


4 
i 
i) 
I 
oror 


3. Basis vector: 


N'y, =0 

y31 
[-1 -1 1 1] |°2] =o 

33 

¥34 


is the same as the equation: 
=> —y31—y32+y33 + y34 = 0 


We have 4 unknowns and 1 equation. So, we can 
set 3 to arbitrary values, e.g.: 


y31=0; y32=1; y33 =0 
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With this choice, we get: 
= y34 =0 


and the non-unique elementary flux mode: 


a) 


Building a conservation relation matrix: 


1\ /1\ /o i oO 
-1)folfa 101 
Tele lad oll 16 4 6 
0] \o/ \4 0 01 


Each non-unique but independent conservation relation 
in matrix Y refers to one linear-dependent row of N. Two 


applications can be studied in the Examples 8 and 9. 


Example 8: Conservation relations 


From the simple hypergraph: 
A+B=—C+D 
we can create the stoichiometric matrix N: 


—1 

—1 
1 
1 


N= 


We have 3 linear-independent basis vectors (m— 
rk(N)) because the stoichiometric matrix N has 4 
species (rows) and one reaction (rank 1). 

In order to find the conservation relations, we cal- 
culate the conservation relation matrix using the 
transposition of N, denoted N’. The approach to 
calculating Y was shown in a previous section. 


N’y=0=[-1 -1 1 1])¥=0 


yi Y2 = 3 
1 1 OJA 
— Y= |-1 0 1/B 
(4-1)=3 0 1 0/1C 
0 0 I1/D 


The constraint matrix does not only constrain the 
network behavior, it also tells us whether a dataset 
is of good quality. The constraint matrix multiplied 
with the metabolite concentrations c(t) is always 
the same: 


Y" c(t =0) = const = Y¥" c(t) 


even if compared to the initial concentration c(0). 
This is useful when we want to perform a consis- 
tency check, scrutinized in detail in Example 9. 


Example 9: Consistency check 


We have a system with 4 metabolites and 2 reac- 
tions: 
D— Aand2A+B—>C. 
The initial concentrations are: 
ca(t = 0) =cp(0) =cc(0) = cp(0) =2. 
Two hard-working experimental biologists mea- 
sure (independently of each other) the concentra- 
tions at the same time point, f: 
Expl: ca(t) cp(t) cp(t) 1 and cc(t) = 3. 
Exp2: c4(t) =cg(t) =3 and cc(t) = cp(t) = 1. 
Can they be right? 


1) Stoichiometric matrix: 


If we apply the calculation of conservation rela- 
tions: 


T 1 0 oO -1 0 

wv-o-| 7, 9 1 a lr=(o] 

You have two possibilities. You can either set Y = 0, 

which is the trivial solution no one is interested 

in, or you find non-zero values for this matrix. Af- 

ter Gaussian elimination, we obtain the non-trivial 
solution: 


JI 
1 0) 0 -i1 y2 >= An ) 
Ty _ _ 
ae lee eat: “a 0 | yy | 
ya=a 
We can set two unknowns to the arbitrary values 


y4 = Ay, y2 = An. Consequently, we obtain for the 
remaining unknowns: 


y=a 
y3 = 2A, +A. 
yl a 
di 1 1, | 0 
y=| a | =A Jol+a]i] > y=]o 1 
2A, +A2 2 1 2 1 
MQ 1 0 1 0 


After we obtain the Y matrix, we can perform the 


consistency check with: 


N WY 
LES oR Y 
1021 f213) _ Yow 
er ae kee 
23 1 oe 

211 


or written differently: 

tory! =2+04+2-242=8; y?=0424240=4 
Exp] :y' =1+0+2-34+1=8 y=0+1+3+0=4 
Exp2 :y' =3+0+2-14+1=6; y*=0+3+1+0=4 


This indicates that Experimental Biologist 2 had a 
bad day and made a mistake. 


Conservation relations 


A=c¢-B (e.g. ATP= const — ADP) 
Metabolite A depends on B 


cS 
— = Left nullspace 
LS 
EIS 
i Elementary flux modes 
° 
2 A% BC (blocked) 
S 5 Right null- Neglectable reaction 
Ss space 
~/e 
= 


A> B > C (coupled) 
Reducible reaction 


Reactions r 


rk(N) r-rk(N) 
Figure 5. Nullspace scheme. If the column space is larger 
than the rank, we can obtain elementary flux modes. 
If the row space is larger than the rank, we can relate 
some metabolites to the metabolites being considered 
independent. 


Nullspace to identify conserved metabolites or coupled and 
blocked reactions: 

The nullspace represents linear-dependent vectors that 
fall into the origin of the independent vectors, as shown 
in Figure 5. An excess of metabolites can be described 
as dependent on the core metabolites and then the left 
nullspace disappears. An excess of reactions can contain 
either blocked reactions, which can be ignored, or cou- 
pled reactions, which can be pooled. In both cases, the 
column space can be reduced after the identification of 
the elementary flux modes. 

We learned that a linear-independent subset of elemen- 
tary flux modes is in the right nullspace NK, and that a 
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linear-independent subset of the conservation relations 
is in the left nullspace: 
Y'N=0' @N'Y=0. 

Both matrices contain basis vectors, from which in- 
finite many vectors can be generated. The matrix N is 
either multiplied by the kernel K from the right or by the 
matrix Y from the left, as shown in Figure 6. This makes 
sense because we know that the stoichiometric matrix N 
contains the information of the reactions/fluxes in the 
columns and the metabolites in the rows. During matrix 
multiplication, matrices from the left conserve the row 
space of N representing metabolites (conservation rela- 
tions) and matrices from the right conserve the column 
space of N, representing reactions (stationary fluxes). 
Remember that the number of columns of the left matrix 
must equal the number of the rows of the right matrix: 


= J 


with a,B, and y representing the number of columns 
and rows. 


Example 10: Demonstration of equivalence 
Show that: 
N'Y =0eY'N=0' 
is true for Example 9: 
The stoichiometric matrix N: 


—1 

—1 
1 
1 


N= 


and the conservation relation matrix Y are: 


1 1 0 
-1 0 1 
0 1 0 
0 0 1 


Case 1: N'Y =0 


1 10 

-1 01 
[SE att We 4 fg 

0 0 1 
=[-14+1+0+0 -1-0+1+4+0 0-1+0+41]] 
=[0 0 0 
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reactions 


conservation relations 

Y™N=0 N= 
Nis multiplied by Y" from the left 
> left nullspace 


elementary flux modes 


NK=0 
Nis multiplied by K from the right 
> right nullspace 


metabolites 


Figure 6. Nullspace scheme: The stoichiometric matrix is either multiplied by the left to obtain the conservation of 
metabolites or multiplied by the right to obtain conserved reactions. 


Case 2: Y'N=07 


—1 


1 
1 


(1--1)+(-1--1)+(0-1)+ (0-1) 
=| (1--1)+(0--1)+(1-1)+(0-1) 
(0--1)+(1--1) + (0-1) +(1-1) 


0 0 
1 0 1 0 
1 0 1 


<i +040 
= | SO Io 
06-1401 


I 
o8'9 


1.2 Mf Integration of experimental data 

The balance equation Nv = 0 is frequently underdeter- 
mined and rates have to be measured. A smart idea to 
separate the balance equation into a measured part and 
an unknown part: 


Nv=0 


(yn) = 
Vin 
Nw Vu +NinVin =0 


Nuvi = —NinVin 
—1 
Vy = N, (—NinVin) 


to obtain an inhomogeneous linear equation set. 

As you can see, we are interested in v, and need the 
inverse of the stoichiometric matrix. The inverse N,,! is 
often not available, and the measured matrix entries are 
hardly exact in reality due to measurement noise, which 
leads to inconsistencies. 

The basic integration of data can be studied in a well- 
posed Example 11 with Figure 7, and more generally in 
Example 12 with four typical scenarios. 


GicZ£4 Gep=22DHAP => PEP 


Figure 7. Reduced network of glycolysis from glucose 
to phosphoenolpyruvate with measured and unknown 
rates. 


Example 11: Glucose to phosphoenolpyruvic acid 


For this small metabolic network and model (Fig- 
ure 7), we assume that all components outside the 
system boundary can be neglected. The system 
boundary can be freely chosen and might, in this 
case, be the double-lipid layer of the cell membrane. 
We are interested in the steady-state concentration 
of glucose-6-phosphate (GP), dihydroxyacetone- 
phosphate (DHAP) and phosphoenolpyruvate (PEP). 
The balance equation for those metabolites in steady 
state is: 

GoP 

DHAP } =0 
PEP 
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If we insert the measured values from Figure 7, we 
obtain: 


1.4 
-1 -1l 0 v2 1 0 1 
=| 0 2 —-l y3}]/=-|0 0 @ 
0 0 1 V4 0 -il 
-1 -1l 0 V2 —1 
=| 0 2 —-l vy3}= | 0 
0 0 1 V4 1.4 


We now multiply each side by the inverse from the 
left to obtain the identity matrix: 


and now we know that the fluxes in Figure 7 are 
v2 = 0.3, v3 = 0.7, and v4 = 1.4. 
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Measurement scenarios 

Stoichiometric Network Analysis delivers physiological 
snapshots and is good for testing "if-then" scenarios. It 
also helps us to see how measured flux rates influence 
other rates via sensitivity analysis. However, the mea- 
sured rates are often not sufficient, and loops and alter- 
native pathways might cause problems. 

The dissection of our stoichiometric matrix into mea- 
sured and unknown parts reveals the problem of calcu- 
lating with an inverse matrix: 


Vy = N,! (—NinVin) 


As the inverse N'! is often not available and measure- 
ments are never perfectly correct, we use an approxi- 
mated matrix. The Moore—Penrose pseudo-inverse!7* 
N* is available for all matrices. Any mathematicians 
or physicists among you might enjoy a review of this 
method [2]; all others might just use the MATLAB com- 
mand B = pinv(A). Moreover, we use the kernel matrix 
K,, with related arbitrary vector a so that K,a = v. 

The final equation might be: 


HH 
Vy = —N UN mVn + K,a 


whereby we have 4 cases, which depend on the number 
of unknown reactions x and the number of metabolites 
m. The 4 cases are: 


e determined: rk(N,,) =x. All rates can be deter- 
mined. 


e underdetermined: rk(N,,) < x. Not all rates can be 
determined. 


e not redundant: rk(N,,) =m. 
e redundant: rk(N,,) < m. Inconsistencies likely. 


and are presented in the following matrix as an example: 


al v2 V3 V4 
N=j]1 -1 -1 OJA 
0 1 1 —-I1/B 


Example 12: Principal measurement scenarios 


Here are four different cases for stoichiometric net- 
work analysis. Red fluxes are measured. 

a) The first case describes a determined, non-redundant 
system. We know that the input of the systems 
needs to be equal to its output, thus vy = v4 = 2. 

As v2 = 0, and v2 +13 = vj = v4 = 2, we can easily 
calculate v3 = 2. 


T American mathematician Eliakim Hastings Moore (1862—1932). 
? Swedish geodesist Arne Bjerhammar (1917—2011). 
3 English mathematical physicist Roger Penrose (1931—today). 


0 
) — V4? 
sane B— 
NA 
V3? 


b) The second case describes an underdetermined, 
non-redundant system. We know that the input of 
the systems needs to be equal to its output, thus 
v] = v4 = 2. The fluxes v2 and v3 do not correspond 
to a single solution but the sum of their fluxes needs 
to be equal to v2 +73 =v) = v4 = 2. 


V2? 
— 4 4? 

A Bos 
V3? 


c) The third case describes a determined and redun- 
dant system, in which v; and v4 are inconsistent. 
The input v; = 2 does not correspond to the output 
v4 = 3. 


0 
i. 
+h 


V3? 


B-> 


d) The last case describes an underdetermined and 
redundant system, in which v; and v4 are inconsis- 
tent. 


V2? 
+A B->C 


v ut 
V3? 2 


Case 1: determined, not redundant (Ex. 12a) 


ii [Hi @ 
tly ap My 
-check: rk(N,,) =x =m = 2 —> determined, not re- 
dundant 


12/30 


-result: 


Ol. ahecggta bob Be), 
Ke=|9|: Ni = Ni =[5 oi}: 


Case 2: underdetermined, not redundant (Ex. 12b) 


-given: 


Vn = [vi] = [2]; x=3 
VI v2 V3 «V4 
1 -1 -1 0 
Ba=lg we=[y SI 
-check: rk(N,,) =m =2 <3 =x— underdetermined, 
not redundant 


-result: 
1 -05 0O 
K,=|-1|; N*=|]-05 oO]; 
0 —1 -l 
v2 1 1 
Vu = | V3 = —N*N Vm +K,a = 1)+]-lla 
V4 2 0 


Case 3: determined, redundant (Ex. 12c) 


-given: 
V1 2: 
Vn = |v2| = }|O}; x=1 
V4 3 


1 -l O —l 
nee Fh mel 
-check: rk(N,,) = 1 =x < 2 =m -—> determined, re- 


dundant 
-result: 


K, = [0]; Ni=[-0.5 0.5]; 


Inconsistent and redundant scenarios might be 
forced to consistency with ¥, = 4 = 2.5 (SSR, com- 
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promise between 2 and 3): 


vy = [v3] = NEN Vin +K,a 


2a 
1 -1 0 

=-[-05 03]| 0|+ [0a 
0 Lt =f) |54 


=2.5 


With weighted estimator processes, one could thereby 
consider the variance (measurement error). 


Case 4: underdetermined, redundant (Ex.12d) 


vy V4 v2 ~V3 
1 0 -1 -1l 
Wa=|q S| Me=[7 7 
-check: rk(N,,) = 1 <m=x =2 — underdetermined, 


redundant 
-result: 


fli. 4 |—0.25 0.25 : 
K, = El ; N= Ee a2, — get no rate; 


Inconsistent, redundant scenarios might be forced 
to consistency with | = f4 = 2.5 (SSR): 


Vy = "| = NEN Vn +K,a 
V3 
_ [0.25 0.25) |1 0} (2.5 ef 1 
~~ [-0.25 0.25] [o —1] [2.5] 7 |-1]° 
1.25 1 
= [135] + |[-i]2 
An underdetermined and redundant system can 


also have rates, which we can calculate. An exam- 
ple is rate v5. 
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1.3 Mi Constraint-Based Modeling Case 2: v3 =0 — v2 = 1 > mark point:(v2 = 1; v3 = 0) 
Flux Balance Analysis (FBA) [3] isa widely usedmethod Case 3: v3 = 5 Se 5 — mark point:(v2 = 533 = 5) 
within Stoichiometric Network Analysis to determine 
possible flux distributions. It can be conducted with 
a more general Constraint-Based Modeling approach 
(CBM), thereby allowing the integration of more infor- 1 
mation beyond the steady-state assumption. Possible 

applications are: 


V3 


e Prediction of phenotypes Nv 
e Prediction of mutant behavior. 


We will now step-by-step introduce FBA and CBM. In 

the beginning, the solution space can be considered un- __ - Reaction reversibility: 
constrained, meaning that any reaction can carry any Irreversible: v; >0 

flux. Then, we apply constraints, such as the mass bal- _ Reversible: — inf < v; < inf 
ance constraints imposed by the stoichiometric matrix S. 


YouTube: FBA e.g. v2, V3 > 0 (irreversible) 
The basic idea: start from all possible fluxes and incor- 
porate further constraints to limit network behavior to a v3 
smaller and thus more informative solution space. 1 
v Flux space 
J Constraint 2 V2 


Real solution 
- Boundary conditions: 


imposed by experimental setup or measurements, ¢.¢., 
Constraint 1 stationary continuous cultivation: 


Solution space —» not unique 


Vi uU=D (e.g. =0.1h') 
; vj; =; = const. 
The lower Vj. and upper Vinay bounds of a reaction are 


thereby also taken into account. - Enzyme capacities: 


€.8. Vi SVj max (maximal catalysis rate): 


Main applied constraints 

- Mass conservation: (Nv = 0) 

The material that is entering via the influx does not get 
lost or accumulate (steady-state assumption). The out- 
going fluxes need to balance the incoming fluxes for each 
balanced metabolite. However, the fluxes are often given 
in mole per time. Metabolites might be split during bio- 
chemical reactions and thus the respective molar fluxes 
multiply accordingly. But the mass remains the same. cS 


V4 
— >A aa - Thermodynamic constraints: based on chemical poten- 
v3 tials: 


A= v1 — v2 — v3 = 0 Vignin SVjS Vjmax 
> v2+V3=V1 
e.g. with: 
eg v=: 
Case 1: vy =0 > v3 = 1 > mark point:(v2 = 0; v3 = 1) 0.5<v <1 


0.5 


0.5 1* Vj 


The constraint-based model: 

We end up with a constraint-based model for which 
we can use an optimization algorithm—such as linear 
programming—to find the best possible flux combina- 
tion to maximize (or minimize) a given objective func- 
tion F = C'v. The coefficient matrix C thereby defines 
which flux(es) shall be optimized. This optimization is 
constrained by the system in the steady state itself Nv =0 
but also by all incorporated equality and inequality con- 
straints: 


maximize Chv=cyyy tonya +... + eye 
Vaim 
subject to Nv=0 
0<); ;some j 


vj =¥; =const. ;some j 


Vj,min < Vj Ss Vijjmax 7 some j 


We obtain a system of linear equality and inequality 
constraints. This also defines a cone in the v-space, as 
shown in Figure 8. 


Example 13: Apply the constraints and optimize 


We have the simple system: 


with: 
e mass constraint: vj = v2+ 3 
e reaction irreversible: v2 > 0,v3 >0 
e enzyme capacity: v) <1 


Solutions space? Unique solution? 


v3 


1 
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The idea behind this is that biological systems are 
evolutionarily optimized to maximize a certain be- 
havior such as: 


e maximal growth yield (microorganism) 
e maximal growth rate (microorganism) 
e maximal energy production (mitochondria) 


e maximal yield of product (genetic manipula- 
tion by human). 


We optimize the objective function F with: 


F=C'.y— max 


V1 
e.g. F=|0 1 0] V2 
V3 


— max 
F=y2 > max 
v3 


Optimal solution 


i 


1 V2 


The solution is obtained by linear optimization, 
which is fast and reliable. We use software for it, 
although we have the problem that the algorithms 
usually compute only 1 optimal solution. Other 
equally optimal solutions might exist and need to 
be determined by corresponding algorithms (like 
flux variability analysis or random sampling). 

We could now also include an enzyme capacity e.g. 
v2 < 0.6: 


v3 


V2 


Then, the optimization does not result in a single 
solution. Instead all values 0 < v3 < 0.4 are possi- 
ble. 


What would happen if we impose v3 = 0.5? 


>yy=w+73=064+05=11<1 % 
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V3 V3 v3 
constrain 
1) Sv=0 optimise 
2) Vimin < Vi <Vi,max max v3 


v4 


unconstrained constrained 
solution space solution space 
V2 V2 Vo 


V3 v3 


V3 max 


V2 


4 


max catalysis rate 
y Vo +4V3=V4=4 max V2 


of the enzyme 


Figure 8. FBA and constraint-based modeling. 

Top: Stoichiometric Network Analysis might leave a large solution space. Due to the mass balance constraints 
imposed by the stoichiometric matrix S and the upper and lower bounds of a reaction, one obtains a constrained 
solution space (red). This space is used by optimization algorithms to find one optimal solution for biomass (for 
example). 

Bottom: One constraint captures the maximum turnover velocity of an enzyme V3 max < 1, which is combined with 
another constraint v; = | from measuring this rate. We know then that the sum of the v2 and v3 fluxes must be 1 as 
well (dark lines). The red area shows the constrained solution space for an alternative vj constraint: 0<vj <1. 
After maximizing v2, it becomes clear that the enzyme responsible for v3 has to be inactive in order to maximize v2. 
Adapted from [3]. Copyright © 1969, Nature Publishing Group. 


Example 14: FBA example that the influx is fixed: 
Given a simple network: yet 
a v2 We can further assume that some fluxes depend 


on enzymes with limited capacity v3 < V3 may and 
assume that one flux such as through the biomass 


V3 . . 
reaction has to be maximized: 


vy 
max F =C'v= [0 1 0] v2} =vo 


: ! V3 
S=vj—v2—-v3 =0 


Vy =V2 +3. The optimal solution is vz = 1. 


with mass conservation Nv = 0 leading to: 


We assume that all reactions are irreversible and 
the respective rates are thus positive: 


V2, V3 2 0 


We can introduce boundary conditions, such as 


Aout Bout Pout Eout 
fe) eee Pal o_o 1h al 
1 ae 
Vg 
AC MP 
Vo D E 
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Aout Bout Pout Eout Aout Bout Pout Eout 
il 1 val | |Reeee | eee i 
5+ Bat > Bai 
AES ie | 
| A=>C =P | | APC 
1 cag H ‘ v. 
<p 7g i DOE | 


maximize Pout with fixed Aout and no Bout. We obtain two extreme linear-independent solutions: the first solution 
with direct flux from A to B and the second solution indirect via C. The last scheme represents infinite solutions 
with different ratios of the two extreme pathways from A to B. 


Example 15: Opt. flux balance for P.x_ without Bex 


We use the metabolic network shown in Example 
5 and Figure 2. We search for the optimal flux 
distribution for blocked B,., and maximized Pow 
aS shown in Figure 9 with objective function F = 
C’ v: 


maximize C’ v = (0,0,1,0,0,0,0,0,0,0)v = v3 
With constraints: 
e Nv=0, according to Example 5. 


e Set fluxes v; = 1 (uptake) and v2 = 0 (no up- 
take) according to the available measurement 
information. 


e Lower bounds vin = (1,0, 0, 0,0, 0,0, — inf, 0,0) 
Upper bounds Vmax = (1,0, inf,..., inf). 


The resulting infinite solutions (linear combina- 
tions from the first two solutions) are illustrated in 
Figure 9. 

Always check uniqueness. 


Optimization after knockout: 

Another optimization example is shown in Figure 10. 
The central metabolism across different related organ- 
isms is especially well-conserved while pathways with 
less evolutionary pressure might be less optimized. One 
can also assume that organisms optimize their program 
after knockout experiments, as shown in Figure 11 and 
studied e.g. by Segre et. al. [4]. 


Figure 10. FBA constrains the solution space. If we only 
consider fluxes vj and v2, we can either optimize for v; 
or v2 alone, or we can optimize for the sum of the fluxes 
of vj + V9. 


FBA of knockout 
(optimal) 


V2 
FBA of wild type 


(optimal) 


Objective function 


FBA of knockout 
inane ies max CTv = c,v1 + C2v2 


(suboptimal) 


Feasible space of 
wildtype 


Feasible space of 
knockout 


Figure 11. Evolutionary adjustment after knockout of 
a gene and its respective reactions. After a knockout, 
the feasible space is reduced and a suboptimal solution 
might be present. With time, the evolutionary optimiza- 
tion step b results in a new optimal flux distribution in 
the reduced solution space. Figure guided by [4]. 
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1.4 MH Problems & discussion — Unique value of the objective function (‘growth’) 
False positive and false negative prediction of gene/reaction - Existence of infinitely many optimal solutions 
knockouts with optimal value of objective possible. 

False positive prediction (the organism does not live in 

reality after the knockout although the model states that e Without incorporating further constraints often 

it should): poor performance in predicting flux distributions. 


e Kinetics / regulation not considered in the model 


e Neglected side-effects, e.g. toxic intermediates. 


False negative prediction (the organism does live in re- 
ality although the model states that it should not): 


e Proof for incorrect network structure — Quality 
control & iterative model refinement possible. 


Limited applicability 


e Steady-state assumption — no dynamics consid- 
ered, e.g. accumulation of metabolites 


e Focus on mass transfer — barely applicable to cel- 
lular information processing. 


Objective function 


e Strong dependence of results on choice of objective 
function 


e Use of ‘natural’ objective functions such as growth: 


- Not applicable to all organisms (e.g. cells in 
multicellular organisms — cancer) 


— Not applicable under all conditions (e.g. after 
perturbation of an organism) 


e Alternative/ conflicting numerical approaches for 
optimization 


e Alternative approach for fluxes: principles of flux 
minimization (~ effort for establishing a network). 


Alternative optima 


e Linear programming problem: finding a solution 
can be guaranteed 


1.5 Mi Metabolic Flux Analysis 

Metabolic flux analysis (MFA), together with C-labelled 
isotope experiments, improves the characterization of 
fluxes and is called isotope-based flux analysis. Different 
experiments using labelled isotopes are summarized in 
Figure 12 together with the two possibilities of stoichio- 
metric network analysis without isotopes. We do not go 
into detail but you are encouraged to study Figure 13 
and the referenced paper. 

Metabolic flux analysis gives us deeper insights into 
metabolic processes. See also the following reviews 
[5, 6]. The models remain small. See also Figure 14 
for a comparison with the previously described Flux 
Balance Analysis (FBA). FBA returns a larger solution 
space, which is more helpful for large metabolic net- 
works. However, these solutions might contain different 
flux distributions that can all maximize the assumed cel- 
lular objective, i.e. growth or biomass production [5]. 
The different principles of MFA and FBA are reflected 
by two major optimization strategies in computational 
biology shown in Figure 15. Either one imposes the mea- 
surements and either tries to reduce the sum of squared 
residuals, or one defines boundaries in which the func- 
tion is forced to stay. 


Isotopic Steady State 


NO TRACER YES NO 
C *8C-NMFA 
Flux 
o 18C-labeling 
s a 
no > 
> 
ne) 
oO ? 
ao) time —7> 
eH E 
oO 13¢C.. 
5 LEGEND C-DMFA 
a 
oO 
2 9 Flux 
—  |lsotopic Labeling 
3 Measurement 
time ——~> 


Figure 12. Isotope experiments for dynamic and steady- 
state MFA. The blue line represents metabolites and not 
fluxes. Source [5]. Copyright © 2015, Oxford University 
Press. 
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Cell culture 
Labeled Feed Measure 
Substrate => “] 
Simulate Metabolite labeling 


Compare | 


Lack of fit 


M2 M3 


I simulated 
Wi Measured 


Adjust fluxes 


Figure 13. Isotope tracing and '?C-metabolic flux analy- 
sis. "In simple metabolic networks, each pathway pro- 
duces a unique labeling pattern in the final product, and 
the resulting mass isotopomer distribution provides a 
direct measure of relative flux in the network. Mass iso- 
topomers are molecules with the same chemical formula 
but different molecular weights due to varying incorpo- 
ration of heavy isotopes. They are denoted MO, M1, M2, 
etc., in order of increasing weight. In complex networks, 
a computational model is applied to determine fluxes 
by minimizing the lack of fit between simulated and 
measured labeling patterns at multiple pathway nodes. 
The flux parameters in the model are iteratively adjusted 
until the optimization converges". Direct quotation: [7]. 
Licence: CC BY-NC-SA 3.0. 
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Flux balance analysis Stoichiometric flux analysis Isotope-based flux analysis 
(FBA) (MFA, DMFA) (*3C-MFA, *7C-NMFA, 12C-DMFA) 


Large-scale model Small-scale model Medium-scale model 
(e.g. genome-scale) (simplified) (central metabolism) 


| } } 


Maximize Vgowen Minimize SSR Minimize SSR 


s.t. Sxv=0 Sxv=0 Sxv=0 
LB<v< UB Rxve=r fisotopomer(%,V) = 0 


Flux solution space Simplified 


Metabolic flux map Detailed and precise 


Metabolic flux map 


Figure 14. Overview on methods. Each of these methods uses mathematical optimization to approach the measured 
data. S x v = 0 corresponds here with Nv = 0 and R x v = r corresponds here with N,v, = —NmVm. FBA: fluxes are 
balanced until they remain between defined lower (LB) and upper (UB) boundaries. MFA: numeric adjustment 
to minimize sum of square residuals. DMFA: dynamic metabolic flux analysis. SSR: sum of squared errors of 
prediction, either without isotopes (Stoichiometric Network Analysis) or with isotopes (isotope-based flux analysis). 
Source: [5]. Copyright © 2015, Oxford University Press. 


Measurement-based fit (MFA) 


Minimize SSR 
@ 


Minimize SSR 


Constraint-based fit (FBA) 


Maximize gain 


Figure 15. Conceptual illustration of fitting a function 
to measurement data vs. constraining it: Functions can 
be fitted in at least two ways. The most frequent is the 
measurement-based fit. Mean values (red dots) are im- 
posed and a function (blue line) shall come close enough 
to the function in order to fit, usually by minimizing the 
sum of squared residuals (SSR). The second possibility 
is the imposing of constraints. The function has to stay 
within the bounds. The latter can also be based on mea- 
surements, e.g. by using mean +/ standard deviation as 
the maximum and minimum values. 
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2. Exercises 


®@ Stoichiometric matrix N 
a) Formulate N for the following biochemical system: 


y:E+S=ES 
w:ES>E+P 


b) Formulate N for the following biochemical system: 


y:£+S=ES 
w.:ES>E+P 
vy3:E+I=E!I 


v4: ES+1 = ESI 
vs :EI+S = ESI 


c) Formulate N for the following biochemical reaction 
chain: 


A=B3C2DE=A 


d) Extract the biochemical reaction of metabolites Sj 
to Ss from the following stoichiometric matrix N: 


1 -l1 0O 0O 
0 —2 0 O 
N=|0 1 -1 0O 
0 3 -1l 0O 
0 0 2 -!1 


@ Flux Balance Analysis (FBA) 
The following biochemical equation system is given: 


yi GAA @ @) 
wW:8—->8B pepecexden, 
v3:A3C A B: 
vy: BoC 174 
vy:B->D oie 
w:C7 So ‘ 
v7iD-@ 


Under the steady-state condition, the following rates 
were measured: vj = 1, vo = 1, and v7 = 0.5. Calculate 
the remaining rates v3 to vs, applying the methodology 
of Flux Balance Analysis (via N and by dividing the 
balance equation into known and unknown parts)! To 
make your life easier: 


4 2 OD 0 -1 0 0 0 
0 =f at Oy) oa =f oO <1 
bt © <1] “|e oO oO 4 
0 1 0 a ae 
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@ The kernel matrix K 
a) Calculate the kernel matrix K of the following stoi- 
chiometric matrix N= (1 1 —2.) 


b) Determine dead ends and unbranched reactions 
pathways, given the following kernel matrix K: 


—1 
0 
—1 


0 
—1 


—1 
—2 


Or onoror 
lio 
ne 
FoNnoocOoONS 


@ Constraint-Based Modeling of an example network 
An example network is composed of substrates (Sj, Sz), 
intracellular metabolites (A, B,C), and the biomass (X). 


e The following reactions v; to v6 take place: 


ypiS; 3A 
2:52 9B 
yw3:C73X 
vw:A>C 
v5: BoC 
w:A=B 


e Reactions vj,v2,v3,V4,V5 are irreversible. 


e For ve, a maximal enzyme capacity is given: v6 < 


V6,max+ 
e Due to the experimental setting, v3 is fixes to v3 = D. 


Questions: 


a) Draw the biochemical network based on v; to v6 
and the irreversibility information. 


Ss 
wa 


Formulate the balance equations for the intracellu- 
lar metabolites A, B, and C in steady state. 


a 
wa 


Replace v3 by D in the resulting equations. Keep 
v, and vo as variables and solve for v2, v4, and vs. 


= 


Apply the irreversibility information of v; to v5 and 
extract therefore (several) inequality constraints 
for the remaining variables v; and v6. 


e) Mark the possible solution space using a v6 over vy 
plot using the obtained inequality constraints and 
the enzyme capacity V6 max. Distinguish thereby 
two cases: 

1) V6,max =D 
2) Vo,max < D 
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f) Give the (unique) solution for the case vg > D while 
optimizing for v6 — max (Remember that when 
not otherwise indicated, the maximization of a re- 
versible reaction implies the maximization of the 
production of the metabolites on the right side of 
the equation). 


Notes 
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3. Solutions 


Do not betray yourself! 


@ The stoichiometric matrix NV 
Task a) Formulate N for the following biochemical sys- 
tem: 


y:E+S=ES 
w2:ES>E+P 
The solution is: 
al i) 
—-1 1l1/E 
N=]-1 O|]S 
1 —Il1|ES 
0 1/P 


Task b) 
tem: 


Formulate N for the following biochemical sys- 


y:£+S=ES 
w2:ES>E+P 
v3:E+I=EI 
v4: ES+I= ESI 
v5: EI+S=—ESI 


The solution is: 


1 
0 
0 oOo -1 -1l Oj; 7 
0 0 1 0 -—-1) EI 
0 oO O 1 1 |ESI 


Task c) Formulate N for the following biochemical re- 
action chain: 


A=BH=CH]D=E=A 


The solution is: 


[-! 0 oO O Le 
nv=|1 —1 6 0 OB 
“IO 1 <1 0 oOl¢ 
0 O° tf. =f O10 
0 0 0 1 -IL/E 


Task d) Extract the biochemical reaction of metabolites 
5S; to Ss from the following stoichiometric matrix N: 


V1 v2 V3 V4 
1 -1 O 0 |S, 
0 —2 O 0 | So 
a Oo 1 -1 O1]83 
0 3 -l OS, 
0 0 2 —I1|Ss5 
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The solution is: 


yi: 03> S$ 

v2 2S, +2S2 + $34+35S4 
v3 :53+S4 > 285 

v4 295 => 


@ Flux Balance Analysis (FBA) 
The following biochemical equation system is given: 


VIIG OA @ Q@) 


2:03 >B eo 
v3:A>C A B' 
vw: BoC Le 4 
vs:B—+D ae 
w:C3S 
wi: D7A@ 


Under the steady-state condition, the following rates 
were measured: vy; = 1, v. = 1, and v7 = 0.5. Calculate 
the remaining rates v3 to ve, applying the methodology of 
flux balance analysis (via N and by dividing the balance 
equation into known and unknown parts)! To make 
your life easier: 


-1 0 0 0 1 0 @ 0 
0 20 0, et Sb 0. 4 
1 1 -1} ~{0 0 0 1 
0 1 0 — oo oe 


The solution is: 
From the biochemical equation system we get: 


V1 v2 V3 V4 V5 V6 V7 
-1 0 0 0 0 
N = 


ocoocr 
ooro 

— 

i 

oO 

| 

—_ 

oO 
yawns 


1 
Doe -t © oo oO ; 0 
Ot © -1 +1 o o]|'|..)0 
00 1 1 0 -1 0 aml feat (te 
Co © oO Tf @ =i) |= 0 
V6 
0.5 


Removing columns that corresponds to v;, v2, and v7: 


10 0\ /, 
0 -1 -1 O]fu}J__fo1 off, 
1 1 0 -1] [vs 00 0} \o. 
0 1 0} \v 0 0 -1) 
1 
7 1 
= 
0.5 


Let us bring the flux vector to the right and solve the 
system for v3, v4, v5, and v6: 


v3 -—t 02 © © I 
ve} {0 -1 -1 0 =i 
ve} | 1 1 0 -1 0 
V6 0 0 1 O 0.5 

=f OG O ©) 7-1 

ta =f @ St) pi 

= 10° O O- A 0 


+1+1-—0.5 


1 
0.5 
0.5 
1.5 


II 
ee 

ae 
un 

jo) 

alo 
S 
n 


Drawing: 


measured 


J@ YO” 


A B 
| vA os — 

Cc" D 

f25 |@ 


i The kernel matrix K 
Task a) Calculate the kernel matrix K of the following 
stoichiometric matrix N: 


The solution is: 
With 
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we calculate: 


© 1kj, + lkj2 — 2k;3 =0 


In order to solve the system, we first need to determine 
the nullity: 


r—rk(N) =3-1=2 


We have an underdetermined system and 2 kernel vector 
are necessary to solve the system: 
2 assumptions are necessary, ki2 = co and kj3 = c3: 


ky + leg — 203 = 0 
4 ki = 2c3 —c2 


2c3 — C2 
6 ki = C2 
C3 


Let us choose 2 non-identical values for cz and c3: ¢.g. 
C2 = 1 C3 = 0 


2 
k= 10 
1 
thus, 
-1 2 
K=j|{ 1 0 
0 1 
Task b) Determine dead ends and unbranched reac- 


tions pathways, given the following Kernel matrix K: 


—1 
0 
—1 


—1 
—2 


Om onoror 
loo 
a 
FHFonocono 


The solution is: 
Dead ends: v4 
Unbranched reaction pathways: 


© V1, V3, V7 


© V2, V6 
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@ Constraint-Based Modeling of an example network 
An example network is composed of substrates (Sj, Sz), 
intracellular metabolites (A, B, C) and the biomass (X). 


e The following reactions v1 to v6 take place: 


yiS,3A 
v2:52 >B 
v3:C3X 
w:A>C 
v5: BoC 
vw:A=B 


e Reactions v1,v2,v3,V4, V5 are irreversible 


e For ve, a maximal enzyme capacity is given: v6 < 


V6,max 


e Due to the experimental setting, v3 is fixed to v3 = D 


Questions: 


Task a) Draw the biochemical network based on 1, to 
ve and the irreversibility information. 


V4 
ae 


V5 
S2—_~> B 


Task b) Formulate the balance equations for the intra- 
cellular metabolites A, B and C in steady state: 


A=v,i—va—v6 =0 
B=v.—vs +v6 =0 


C=v4tv5—v3=0 


Taskc) Replace v3 by Din the resulting equations. Keep 
v, and vo as variables and solve for v2, v4, and vs: 


A:v4=Vi —V6 
C:vatv5—-D=0 
6 v1 —Vv6+v5 —-D=0 
&v5 =D+Vv-V1 
B:v2-D-—vot+v+v =0 
2 v= D-yv, 


Taskd) Apply the irreversibility information of v; to vs 
and extract from there (several) inequality constraints 
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for the remaining variables v; and v6: 


yy >0 

v2=D-vy>0 
>y<D 

v3=D 


V4 =Vi —V6 > 0 
V6 SV 

v5 = D+v6-—vy > 0 
>Vv>v,—D 


Taske) Mark the possible solution space using a v¢ over 
v1 plot using the obtained inequality constraints and the 
enzyme capacity V6 max. Thereby distinguishing thereby 
two cases: 


Taske.1) V6 max = D 
V6 V6 
v,=0 v,<D D 
V4 Vy 
V6 V6 
D D 
V6 Sy Ve 2V,—-—D 
D V4 D v4, 
—D. 
Help: 
Case 1: vy} =D 
V6 > 0 (fix the point) 
Case 2: vj =0 
v6 > —D (fix the point) 
Taske.2) V6max <D 
Ve,max >D V6,max <D 
V6 V6 
D 
Vemax 
0 D D 
Vi Vi 
—Vemax 
—D 
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Task f) Give the (unique) solution for the case vg > D 1 
while optimizing for vg — max. 


Vy 
ss 


V6 


Flux cone of an example network 
From the previous system, we know that: 


ere LL 


1. vj to 5 are irreversible 1 


2: | V6 < V6,max 


3. v3=D 
vy 20 vy >0 
v2 20 v2 >0 


y3 [OS y4+M 50 
vy >O0 vyy—vs >O0 
vs 20 vatvs 20 


al 
CA to 8 —<t @ hi)" 
else lO-1 0 G8 <f. 12)" 
CC OO 1 i 1 M 
V5 
V6 


with quasi-steady-state assumption for A, B, and C (0 = 


Nv): 
V1 1 0 O 
vo 01 0 
wut {11 0] /("! 
| TL Oe = . 
V5 06 i 4 : 
V6 0 0 1 


Represented in 3D: 


Notes 


Chapter 3: The magic of change and how to find it 


Thomas Sauter, Marco Albrecht 


Motivation 
The world around us is connected and changes all the time. But do we know where things are going? Are they 
on a path to endless infinity or do they find a condition where they are in balance with all their neighbors? In 
reality, we often see mysterious black boxes, and it is not clear why we get a particular outcome for a specific 
input. Sometimes the result depends on the history of movements. If this is the case, we see different final results 
for the same condition. In this block, we strive for answers in the realm of systems science. We will learn to 
model so-called systems in time, and we will ponder on how to bring them to desired states as quickly and 


precisely as possible. Likewise, we will learn how organisms develop an effective control of various regulatory 
biochemical processes forged by the forces of evolution. On this journey, we will also understand the beauty 
of connected things and what they teach us when we address biological problems, a perspective that make us 


better experimental biologists too. 
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1. Lecture summary 


In the previous two chapters, we studied systems in 
equilibrium and learned about powerful tools for the 
modeling of metabolic networks. However, the process 
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Green fluorescence 


Extracellular TMG (uM) 

Figure 1. Example where the measurement seems myste- 
rious. The GFP signal depends on the non-metabolizable 
lactose analogue thiomethyl-galactoside (TMG), but 
whether we increase TMG or decrease TMG remarkably 
changes the outcome, also referred to as hysteresis or 
multistability. What is behind this? See Figure 2. Source: 
[1]. Copyright © 2004, Macmillan Magazines Ltd. 


of coming to a steady state has been assumed heretofore. 
In this chapter we will introduce some of the theoretical 
basics to treating and understanding biological systems 
which are dynamically changing over time and thus are 
no longer in steady state. 


1.1 & Examples of non-linear dynamics 

Let us start with a dynamic experiment on the popula- 
tions of cells in Figure 1, which also shows that reality 
cannot always be described with basic linear tools. In 
this example, we see that the expression of genes relevant 
for the lactose metabolism (GFP signal) does not merely 


https://doi.org/10.11647 /OBP.0291.03 
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depend on the availability of the non-metabolizable lac- 
tate analogue (TMG). It also depends on the previous 
amount of TMG in the cellular environment. If 30 ug 
TMG were present at the beginning and this concen- 
tration was further decreased, gene expression for the 
lactose metabolism is measurable down to 5 ug TMG. 
But if, instead, the TMG concentration was increased 
starting with a low value of 2 ug TMG, gene expression 
will not set in until 15 ug TMG. Whether there will be 
lactose metabolism or not in the range from 5 to 15 pg 
TMG consequently depends on the history of the system, 
a phenomena known as hysteresis. Hysteresis can also 
be seen as a bistable state which can result in discrete 
switch-like outputs from continuous inputs. 
Hysteresis increases the resistance to noise as it requires 
higher values of the input in order to switch to a par- 
ticular state, as compared to the input needed to stay 
in a state. The switch also means that a transition is 
not continuously reversible after it has been triggered. 
The behavior is often hidden and encoded in unknown 
connections of biological elements. Here, the regulatory 
principles behind the observation in Figure 1 are well 
understood and illustrated in Figure 2. 

Other examples of hysteresis can be studied in Figure 
3 and 4, where the regulatory network structure (mu- 
tual molecule dependency) and the time behavior are 
depicted side by side. Take some time to understand 
how these schemes relate to each other. In Figure 4 we 
have an example from stem cell biology with the home- 
obox protein Nanog. Regulatory interconnections are 
not restricted to molecular biology. We might also have 
physiological and anatomical systems with regulatory in- 
terconnections. One example is the control of the blood 
sugar level by the liver and pancreas, illustrated in Figure 
5. Another anatomical example can be found in Figure 6 
on the adaptation of the eye. The restriction of the pupil 
compensates for disturbances with different light inten- 
sities. The control of this system can be drawn as a block 
diagram (see next section and Figure 7). Such regulatory 
circuits are also used in synthetic biology, within which 
scientist constructs or re-design existing biological sys- 
tems for useful purposes. Let us explore the impact of 
some basic network motifs on the dynamics of a system 
next. 


Chapter 3: The magic of change and how to find it — 3/56 


- —>> 


(Octd).. __ {Sox2) 


| 
| 
+ | me  & 
metabolism | yee : - is Se, 
CRP Lacl tae | = J 

l + + 
: + 
| 

GFP I 
| © = 
| 


Stem cell 
state 
C4 


C2 Differentiated 
cell state 


Nanog 


lacA 


lacy 


Prat HeRed ‘i Signal B 
Figure. 2. Underlying the hysteresis in Figure Figure 3. Stem cell differentiation network. Bifurcation diagram 
1 is the specific regulatory network of the Lac of transcription factor Nanog expression as a function of the 
operon model. LacY fascilitates the influx of levels of signal B. Solid lines represent stable states, dashed lines 
TMG, which in turn inhibits the repressor. This represent unstable states. The bistable region is shaded in gray. 
again increases again the production of LacY, Bifurcations occur at signal intensities cl and c2. Source: [2]. 


leading to a positive feedback loop and thus Copyright © 2015, The Company of Biologists Ltd. 
enabling bistability. Reproduced from: [1]. 
Copyright © 2004, Macmillan Magazines Ltd. 


— Nanog PrE-like state 
— Gata6 


Epi-like 
state 


> 2 
2 
3 ICM-like state £ 
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Zz \ e 
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‘ PrE-like 
State 
Fgf4 Time 


Stem cell state Oct4 PrE-like state 


or 
I 
g | 
6) 
+ 
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PrE-like 
state 


Oct4 Time 

Figure 4. Stem cell differentiation network. A progenitor cell can differentiate into an epiblast cell (Epi), primitive 
endoderm (PrE; generating extra-embryonic membranes) and trophectoderm (TE; giving rise to extra-embryonic 
tissues), depending on the culture environment. This is a bifurcation diagram of transcription factor Nanog 
expression as a function of the levels of Fgf4 or Oct4 with different network motifs. Solid lines represent stable states, 
dashed lines represent unstable states. The bistable region is shaded in gray. TE-like state: stem cell switch with 
Nanog is off. PrE-like state: Gata6 expression is present but Nanog transcription is efficiently suppressed. ICM-like 
state: co-expression of Nanog and Gata6. Epi-like state: Nanog expression. Source: [2]. Copyright © 2015, The 
Company of Biologists Ltd. 

Think about the following question: Could one use a linear regression approach between Nanog and Fef4 or Oct4? 
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| Glucagon | 

Figure 5. Blood sugar regulation. If the sugar concen- 
tration is too high in the blood, the pancreas releases 
insulin. Insulin makes tissue cells and the liver store 
or consume more sugar. If the sugar concentration is 
too low in the blood, the pancreas releases glucagon to 
cleave glycogen with subsequent up-regulation of blood 
sugar. Licence: CC BY-NC-ND 4.0 (Fair Use). 


iris muscle (actuator) 


(control) 


brain (control) 


set point 


—>o> 


actual ar 


correction factor actuator control route —_ measuring unit 
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Figure 6. Adaptation of the eye to adjust to various light 
intensities. The light source disturbs the retina. The 
retina detects the light intensity and gives the actual 
value to the brain. The brain compares this value with 
the setpoint. If the light intensity is higher than accept- 
able, the brain gives the signal to reduce the light. If the 
light intensity is too low, the brain provides the signal 
to increase the light intensity. The actual adjustment is 
achieved via the pupil constriction by the iris muscle. 
Credit to Greyson Orlando, wikimedia. Fair Use. 


1.2 H The impact of network motifs 
We distinguish two main types of loops in networks: the 
feedback loop and the feed-forward loop (FFL). 


Definition 1. Feedback: Processes which influence their 
own cause or input. 


Feedback is a type of closed loop regulatory unit, as 
shown in Figure 7. Such control structures are widely 
used in engineering in many applications too—for ex- 
ample, to keep the output y at the desired level. 


r e u 
Sed 


Figure 7. Block diagram of control loop (negative feed- 
back). 


y 


A controller C uses the error e, which is the difference 

between the desired steady state r (set point) and the 
system output y, to adjust the control u. The control u 
acts on the system states x. 
The feedback loop can encode negative and positive feed- 
back, and occurs often in biology, as shown in Figure 
10 with some examples. Negative feedback occurs fre- 
quently among repressors, which repress their own tran- 
scription. This allows a strong maximal promoter activ- 
ity with a fast response, but at the same time strong 
repression once the desired product’s steady state is 
achieved. 


A variety of examples of network motifs have been iden- 
tified in biological systems and analyzed. We can distin- 
guish motifs with 1, 2,3 and more than 3 nodes. 

One node: In the E. coli. transcriptional network, 85% of 
the auto-regulative self-edges are negative and 10% are 
positive [3]. There are many cases where all edges start 
and come back to the same node. 

Two-node networks with feedback occur in e.g. develop- 
mental networks and can be seen as memory elements. 
Once this motif is triggered, a mutual inhibition or acti- 
vation is conserved and remains independent of further 
input under certain circumstances. A decision for the 
subsequent development has been made. For an exam- 
ple, see Figure 26 at the top with the toggle-switch motif. 


Coming to three-node network motifs, feed-forward loops 


are more than 30-fold enriched in the E. coli transcription 
network than random networks. Feed-forward loops can 
be split into coherent FFL, where the indirect path has 
the same overall sign as the direct path, and incoherent 
FFL, where the indirect path sign is the opposite to the 
sign of the direct path, shown in Figure 8. 

Type I of both coherent and incoherent FFL are much 
more enriched in the E. coli network than the other types. 
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Figure 8. Feed-forward loops (FFL) split from one node 
and fall together at another node. Thereby information 
is transmitted in the forward direction via two parallel 
branches (pathways). The multiplication of the signs 
along the interactions of each branch gives the overall 
sign of a specific branch. A coherent FFL results in an 
end node which receives either only positive or only 
negative signals. An incoherent FFL end node receives 
both sign types. Source: [4]. Copyright © 1969, Nature 
Publishing Group. 


The dynamic consequence of the Type 1 feed-forward 
loop with AND gate is shown in Figure 9 and Example 
1. The output Z requires both activated X and activated 
Y. Once X is activated, Y begins to increase and is consid- 
ered active as soon as it trespasses a certain threshold. 
If Y is active, it does not mean that Z is activated, as 
it requires maintained activation of X at the same time. 
This motif can be seen as a noise reducer in fluctuating 
environments. Only a signal that is present over a longer 
period can trigger a response, which arises then after a 
certain delay. An interruption of the input leads to in- 
stantaneous shutdown of the signal without delay. The 
Type 1 coherent FFL with an OR gate would activate 
Z instantaneously, and Y would have no relevance for 
the activation. Instead, the OR gate would make Y into a 
sustained activator of Z until it falls below the threshold. 
This motif would activate Z immediately and keep the 
response activation longer even if the original input was 
removed. The Type 1 incoherent FFL with AND gate 
would create a peak. We would see short activation of Z 
during long time-input activation. Once the level of Y 
trespasses the threshold, it inhibits the activation with 
different levels of strength, determined by the repression 
coefficient. The Type 1 incoherent FFL with OR gate 
would be activated instantaneously, but would acceler- 
ate the off-signal in Y. The acceleration increases with an 
increased repression coefficient. 
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Reminder: Logical operators such as OR and AND 
help us to define rules: 


A Bs AANDB | AORB 
0 O 0 0 
0 1 0 1 
1 0 0 1 
1 1 1 1 


Figure 9. The coherent Type 1 feed-forward loop. The 
output Z shows delay after S, addition in the input, but 
no delay after S, removal. The network motif is a sign- 
sensitive filter, which responds only to persistent stim- 
uli. Source: [4]. Copyright © 1969, Nature Publishing 
Group. 


Example 1: Simulation of coherent Type 1 FFL 


Referring to the dynamic feed-forward loop with 
AND gate and its dynamic behavior shown in Fig- 
ure 9, we show here the underlying computational 
model which allows us to obtain such time course 
plots. This requires the formulation of balance 
equations for each of the modeled molecule con- 
centrations (states), which include mathematical 
terms describing how the different effects change 
that specific concentration. This concept of balance 
equations will be explained in more detail and with 
more examples later in this chapter (Section 1.4, 
page 9) and in Chapter 4. So consider coming back 
to this example after reading more about balance 
equations. 

This balance equation can then either be solved 
analytically—which we will discuss later in this 
chapter—or with the help of computer simulations. 
But these equations can also be solved with the help 
of computer simulations. Such simulations are usu- 
ally used if the equation system is too complex to 
be solved analytically. Below, we give the com- 
puter model in the IOM toolbox format [5]. Other 
tools use similar but slightly different formats. This 
model is used within computational frameworks 
for numerical integration. This allows us to obtain 
usually very accurate approximations of the sys- 
tem behavior for a specific set of initial conditions 
and parameters. The parameters in this example 
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were arbitrarily chosen. The activation cutoff for Y 
matches the dashed line in Figure 9. Only above 
this threshold, Y transmits a positive signal (Yeff). 
The chosen initial conditions are matching the val- 
ues of the states at t = 0. 


KKK KKK KK** MODEL NAME 
FFL 


kK KKK KKKeK* MODEL STATES 
k1l«xSx -— k1x«*X 


d/dt(X) = 

d/dt(Y) = k2*X - k2«*yY 
d/dt(Z) = k3*Yeff*xX k3*Z 
X(0) = 0 

Y¥(0) = 0 

Z2(0) = 0 


kK KKK KKK * MODEL PARAMETERS 


Sx = 0 
kl = 1 
k2 = 1 
k3 = 1 
thrY = 0.5 


kkkKKK*K*K***e MODEL VARIABLES 
Yeff = piecewiseIQM(Y-thrY,Y>thryY,0) 


All motifs are possible considering networks with 

more than 3 nodes. It is imaginable that one element 
in a signaling branch might inhibit the first element of 
that branch in a negative feedback loop. This section 
builds upon the work of Uri Alon. His book is a classic 
in systems biology and highly recommended to help you 
understand regulatory network motifs [3]. 
We will revisit feedbacks again within this Chapter (Ex- 
ample 9 & Figures 23/24/25 and the respective text), 
once we have learned about characteristics of systems 
and useful mathematical approaches which will help us 
to formally analyze such motif structures. 


For further deepening of knowledge around this topic 
we also recommend: 
YouTube: Network motifs 


Summary: Mode of action of network motifs 
positive feedback = self-potentiating 

— (often) destabilizing 

— bifurcations 

— bistabilities 

— switching behavior 

— hysteresis 

= self-degrading 

— stabilizing 

— oscillating 

— regulating 


negative feedback 
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(c) Negative tryptophan feedback: Numbers indicate molar fluxes normal- 
ized to the glucose uptake flux (100). Each of the 3 numbers belong to another 
model variant. Source: [8]. Copyright © 2004, Elsevier Inc. 


Figure 10. Naturally occurring feedback in biology: (a) Positive feedback on the genetic level. (b) Positive feedback 
on the protein level.(c) Negative feedback on the metabolic level. 
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1.3 Hi What is a system? 
Next, we are moving into the formal description of such 
networks as systems. 


Definition 2. System: A system is a group of interde- 
pendent items which are interacting with each other in 
a way that forms an integrated whole. 


One can also say: a system is a set of interrelated ob- 
jects (elements, parts) which all have certain general 
properties [9]. 


1. The system fulfils a certain function, i.e. it can be 
defined by a system purpose recognizable by an 
observer. 


2. It has a characteristic constellation of (essential) 
system elements and an (essential) system struc- 
ture which determines its function, purpose, and 
identity. 


3. It loses its identity if its integrity is destroyed. A 
system is therefore not divisible, i.e. the system 
purpose can no longer be fulfilled if one or several 
(essential) elements are removed. 


system 


environment feedbacks f(x) 


system system 

elements x output y 
S 

system 3 

input u r~) 


disturbances z 


system structure f(x) | system boundary 


Figure 11. A system consists of system elements and the 
connections between them [9]. 


As a side note, the system structure might be (par- 
tially) hidden, i.e. we do not know what is underlying 
the observed system dynamic. We might refer to the sys- 
tem, then, as a black box—and we could try to decipher 
the system behavior by analysis of the relationship of 
inputs u and outputs y, as illustrated in Figure 12. 


—» black box y 


Figure 12. The black box concept. We know little about 
the system, but we can learn about its behavior from the 
input u to the output y. 


Let us look at some examples. A sand pile is not 
a system, as after the removal of a sub-part it is still a 
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sand pile. One grain of sand is also not a system, as it 
is only one element. A chair or a building is a system 
because the removal of one particular element destroys 
its integrity. These are static systems. Dynamic sys- 
tems show changes over time. We do not have to restrict 
ourselves to observable behaviors. Relevant state vari- 
ables are of functional importance but not necessarily 
observable. Furthermore, the non-linearity of biological 
systems (non-additivity) is prevalent. 


Linear: v[S;]+ v[S2] = v[S; + So] 
Non-linear: v[S;] + v[S2] 4 v[S + S2] 


nonlinear 


[S] 
As we have seen so far already, non-linear dynamics 
can lead to a variety of phenomena like: 
e oscillations 
e amplification 
e adaptation (see Figure 13) 


e switching, bistability, hysteresis (see Figures 1,3,4). 


input 


Oe IO) 


output wie 
Figure 13. Near-perfect adaptation to varying input levels 
obtained from negative feedback. Other examples and 
source: [10]. Copyright © 2016, Elsevier Inc. 


output (A) feedback (B) Input 


See also 
YouTube: The beauty of a chaotic double pendulum 


We will explore these important behaviors on the next 
pages, starting with the introduction of a mathematical 
approach for describing dynamics over time. 


1.4 § How to describe change mathematically 

The change of a variable is described by a derivative—a 
concept from calculus. The most widely used mathe- 
matical approach to capture such changes in models 
is done with Ordinary Differential Equations (ODE). 
Such ODEs have a dependent variable y which is a func- 
tion y(t) of 1 independent variable such as time t. The 
derivatives of the dependent variable are found with re- 
spect to the independent variable a If the independent 
variable is time, we can use the dot above the dependent 
variable wy = y to indicate derivatives, otherwise we use 
a prime symbol: y’. The dependent and independent 
variable appear in the explicit: 


y= f(t,y) 


or implicit equation form: 


f(t,y,y) =9. 


Important: In the context of models of biological sys- 
tems, often only 1st order ODEs are applied, as illus- 
trated in Example 2. We have therefore included an in- 
troduction to these cases only. ODEs might also contain 
higher derivatives, i.e. derivatives of derivatives. Fur- 
thermore, Partial Differential Equations (PDE) have 
a dependent variable y, which is a function y(z1, z2, ..., 
Zn) Of at least 2 independent variables z;, such as time 
(e.g.: z1 =t) and 3 room coordinates (e.¢.: 22,723, & Z4). 
While an ODE is frequently used for processes over time, 
a PDE is often used to describe processes over time and 
space. These advanced cases are not discussed here but 
are the subject of further reading if you are interested. 


A simple ODE model can be studied in Example 2 to- 
gether with Figure 14. 


Example 2: Simple ODE for biomass growth 


Another example of what x and its time derivative 
x might represent is biomass and its growth. 


x... population size 
x... growth rate 
w= 4 ... specific growth rate 


With the assumption = const, we can write the 
following ODE: 


X= Ux 
with initial condition x(t = 0) = xo and 
ft > 0: exponential growth 


Li <0: asymptotic decay 


Chapter 3: The magic of change and how to find it — 9/56 


visualized in Figure 14. The calculation of the time 
response x(t) is: 


aM |-dt:x 
1 
—dx = dt if 
x t 
-dx=p] dt 
xy ¥ 
[2n(x) hv = H(t — 0) 
In(x) — In(xo) = ut 
In(~-)=ut |e to both sides 
XO 
DE es gl 
X0 
x(t) = xe" 


t 
Figure 14. Population growth x depending on time and 
the specific growth rate 1. See Example 2. 


A set of non-linear ODEs of n-th order can be written 
as: 


dx; . : 
Gp Fits km Pls Past) jl=l...n 


and in vector notation as: 


dx, 
as f(x, p,t) 

with the independent variable t, the time-dependent 
states x = (x1,...,%,)’, mostly time-invariant parame- 
ters p = (p1,.--,Pn)', and the functions f = (fi,..., fn)’. 
Although the system states are time-dependent, they 
should still be linearly independent of each other. The 
order of an ODE equals the number states within the 
set of Ist order ODEs. 


Reactions depend on changing substrate concentrations 
In the last chapter, we considered constant substrate 
concentration in the steady state: 


S=N-v=0 


and we looked mainly at the fluxes v, which can also 
be dissected into forward vy and backward vy reaction 
fluxes. If the substrate concentrations are constant, it is 
possible to combine these with the constant parameters 
to a flux value for a given reaction. 
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With the study of the time behavior of each substrate’s 
concentrations in our system with: 


S=N-v 


we cannot do this anymore. We have to dissect the flux 
vectors into constant parameters k and now variable 
substrate concentrations S;. Therefore, we usually use 
the very important Law of Mass Action formulated by 
Waage! and Guldberg” in 1864 [11]. For each reaction 
j, we can define the concentrations of substrates S; and 
products P;: 


kj 
LemjaS ja = Ym ipPip 
a a, B 


with indexes a and f being subsets of the metabolite in- 
dexes i= 1,...,mand molecularities m jg,,m jg for reaction 
jand participating substances a, 8 € i. The molecularity 
is often 1 or 2. 

We can say that the reaction rate is proportional to the 
probability of collision of the reactants. The probability 
of collision, in turn, is proportional to the concentration 
of reactants to the power of molecularity (stoichiometric 
coefficient). The general mass action kinetics for chemi- 
cal reaction networks is: 


vi = vig — Vib = kes] Sia” — i Pig 
B 


with the product of sequence operator [] which indicates 
that the following elements are multiplied (same princi- 
ple as for the operator Y to sum up the following terms). 
The equilibrium constant: 


7 5B 
=> Kj — Kj — Tp F i6,e4 
Ay Ma Siete 


for reaction j can be obtained if the substances are in 
equilibrium v;,7 = v;,. In contrast to Chapter 2, here 
we do not have a dynamic equilibrium of metabolites’ 
production and consumption. 

For the biochemical reaction: 


k 
S, +S, == 2P 
k-1 


we get the net reaction rate: 
V=ve—Vp = ky Sy -S2—k_] . p? 


with S,, Sz and P denoting the respective concentrations 
and k, and k_, being the rate constants of the forward 
and backward reactions. 


1 Norwegian chemist Peter Waage (1833-1900). 
? Norwegian mathematician and chemist Cato Maximilian Guld- 
berg (1836—1902). 
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One can also derive the equations directly from the 
reaction schemes without using the stoichiometric ma- 
trix. This might be error-prone for larger biochemical 
systems and molecularities of 2 and above. For the given 
scheme, the ODE system is: 


S, = —k SS +k_,P? 
Sy = —ky 81S +k_,P* 
P = 2k,S\S> — 2k_\P* 


= - 2 
= 2 -(k1S,S2—k_-1P*) 
N v 


This modeling principle can equivalently by applied for 
protein-binding reactions where the parameters k repre- 
sent the affinity between proteins—for example, between 
ligand and receptor: 


ky 
L+R=—LR 
k_} 


with the net reaction rate (from left to right): 


V = Vbind — Vunbind = ki -L-R—k_1-LR 


We could, for example, use this equation to see how 
much ligand-receptor complex LR will be present for 
certain initial concentrations of ligand L and receptor R. 
High affinity with a large constant K, =k; /k_, shifts the 
equilibrium towards the complex, and low affinity shifts 
this equilibrium to the monomers. After we have mod- 
eled a reaction network, an equilibrium can be found 
for the whole system. Trying to derive such an equi- 
librium for many coupled reactions by intuition alone 
might be too challenging. Thus, we need mathematical 
models. The ODE modeling framework gives the highest 
quality of mechanistic insight among all tools in systems 
biology—but only if we have enough data to adjust this 
kind of model properly. 


1.5 Ml What is a steady state? 
A steady state is characterized by the absence of change 
in the variable values while time progresses. 


Definition 3. Steady state: In systems theory, a system 
or a process is in a steady state if the variables (called 
state variables) which define the behavior of the system 
or the process are not changing over time. 


This definition is referring to macro-scale changes over 
time. Back reactions equal forward reactions at the molec- 
ular scale in a closed system. In an open system, a dy- 
namic equilibrium occurs in which fluxes and reactions 
are constant. We have a steady state if all derivatives of 
all state variables x; become zero: 


Si =0 


xp = 4; 


for all i. Moreover, specifically for the 1st-order systems 
we consider in this chapter, we have a steady state if the 
(1st-order) derivatives of all state variables x; become 
zero: 


x; = 0 


for all i. 
Consider the following Examples 3, 4, and 5: 


Example 3: Balance equation in steady state 


dP. 
—=0=J-—P 
dt 
Steady state? 
6=J-P=0 
=>J=P 


Example 4: One steady state of a linear system 
The derivatives of all states shall be zero: 


x=y+2x-2=0 
y=y-x+2=0 
Subsequently, we mark all state values with a star 


* to indicate that these are steady-state values and 
re-order the equations above as follows: 


y* =2—2x* 
y* =;x*-2 


Eliminating y* by setting both of the right-hand 
sides to be equal delivers: 


2—2x* =x* —2 
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which gives the steady-state solution: 


a eo 
=e <a e 


Example 5: Steady states of a non-linear system 


We work on a non-linear system and will thereby 
often obtain multiple steady states, like in this ex- 
ample: 


X= (y+2x—2)x=0 
y= (y—x+2)y=0 


This set of equations is already in the factored form. 
1 of the 2 factors in each single equation must be 
zero so that the whole term is zero. We have 2 
equations with 2 factors each. Thus, we expect 4 
steady states, whereby a subset can be the same. 
To simplify our approach, we will look at different 
cases. 

Case 1 

Let’s say the first variable x* shall be zero: 


x} 9=0 
X= (y+2x—-2)° x*=0 
y=(y-x+2) y =0 
a ee 


3 yi 


Then Equation 1 is zero as well. The second equa- 
tion is then: 


0= (y* +2)" 


which results in two allocated y* values: 


yi=0, y,=-2 
with which we get our first 2 steady states (0,0) 
and (0,-2). 
Case 2 


We now take the other factor (y+ 2x — 2) in the first 
equation and set it to zero: 


! 
=0 > y34 
S 


—_~ 

x= (y+2x—2) x=0 

y= (y-x+2)y=0 
~~ -—— 


x19 
Therefore, y* has to be: 


y' =2(1-x"*) 
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with which the second equation becomes: 
0 = 2(1—x"*)(4—3x*) 
which results in: 


4 
B=1, y= 3 
and two allocated y* values with which we get our 
last 2 steady states (1,0) and ($,—3). 
Case 3 (redundant) 
Ansatz: y* =0 
with the solutions (1,0) and (0,0). 
Case 4 (redundant) 
Ansatz: y* =x* —2 
with the solutions (0,-2) and G, —4). 


1.6 @ Stability theory: Idle state or explosion 
Stability is a system-intrinsic property and is character- 
ized without external input u = 0. A non-linear system 
can have several steady states. The system’s steady state 
might be slightly deflected by 6x: 


Xo =x*+6x 


and the subsequent behavior indicates whether a system 
is stable, unstable, or metastable—as illustrated in Figure 
15. 


\e @ 


unstable stable metastable 
(system leaves (system returns (behavior is 
steady state) to steady state) indifferent) 


Figure 15. To get an initial intuition, imagine a ball on 
different surfaces. If you poke it a bit, what will happen? 


To test this, we linearize around the steady state of the ex- 
ample system x = mx and can see by the slope m whether 
the steady state is stable or not. Linearization is scruti- 
nized in the mathematics section. Ponder on Figure 16. 
The linearization: 


Ax = JAx 


with the Jacobian matrix J is explained in the mathemat- 
ics section. The formal approach employs the exponen- 
tial function starting from the system equation: 


x = mx 


with the ansatz x(t) = e” and its derivative x(t) = s- 
e”, and consequently we transform the system into the 
Laplace Domain. It results in: 


se’ =me’>s=m 


with s being the Eigenvalue of system x = mx. See also 
Figure 17. 


The characteristic polynomial and Eigenvalues for higher 
order systems 

In general, the characteristic polynomial allows us to 
determine the Eigenvalues of the system and thereof the 
stability of the steady states. 

An ordinary differential equation in the non-matrix rep- 
resentation 


4 
Y Gehan — bou 


is the base for the characteristic polynomial without con- 


sideration of the control (u = 0). We use the Laplace 
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m<0O 


X= mx 


stable 


unstable 


Figure 16. Assessing stability with an x over x plot. For 
the unstable steady state, if x is deviating in the positive 
direction from steady state, the resulting derivative will 
also be positive and thus make x even more positive— 
and vice versa for negative deviations. Contrarily, for 
the stable steady state the derivative (change) will be 
negative if x is positive and positive if x is negative. This 
will outbalance deviations and stabilize the steady state. 


x(t)~ S,m>0 x(t)}¢ s=m=0 x(t) sm<0 
eS! 00 et! +0 
t t t 
unstable meta stable stable 


Figure 17. Behavior over time for the unstable, metastable, 
and stable system of the form x = mx with different pa- 
rameter values m. 


ansatz: 
_ et 
x=se™ 
x s2 et 
Y= 9 et 
(4) 
Y= st et 


to transform our problem into the frequency domain 
and get: 


s’+...+as* +as+ay =0 
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Example 6: 2nd-order system 


X¥+ajx+aox =0 


ansatz: 
ae et 
x= se" 
R= sre" 
sre" tase" tage" =0 |: e%(40) 
r+ a, s+ a, =0 
Ww 
tr det 
—ay+, a —4apo 
> 512= —  —- — 


The linear x = Ax or linearized system x = Jx in the 

matrix form can also be treated with the same ansatz: 

x= Ie" 

x=sle" 
and the identity matrix J to get: 

sIe" = Ale“ = Ae™ 
(sI—A)e" =0 
(sf —A)x=0 


We only obtain non-trivial solutions (x 4 0) of the homo- 
geneous system if we look for: 


det(sJ — A) =0, 


which is another form of the characteristic polynomial. 
However, if the system matrix is a square matrix, and 
the rank equals the number of variables (rk(A) =n), we 
only obtain the trivial solutions (x = 0). 


Eigenvalues of the 2x2 matrix (fast equation) 

The characteristic polynomial of a 2nd-order system (n = 
2) is introduced in Example 7 and can be obtained by 
the following procedure: 


Xy = 441 X1 +.442X2 


X2 = az1X1 +.22X2 


=> 
1) (41 42) (M1 0 
X2 a21 422) \x2 
WHE eS 
x A x 
with: 
x= Ie" 
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we get: 

SX] = G11X] 1 412X2 

SX2 = 421X] + 422X2 
=> 

. X1\ (411 412 xX] 6 
X2 421 422 X2 
—_—eC—O eo 
x A x 


‘ 1 O\ fan an en ee 
01 ar, an x2) 
SNe 


|st—A| adhe fF | 7 a | 
0 a2, a22 


1 
_ {fs O} fan ai 
— Os a2, 422 


(s—a1)(s—a22) — aj2a21 =0 


2 
S” — Al S — A228 + a1 22 — aj2a21 = 0 


a 1122 — a\2a21 = 0 


detA 


s* —(ay; +an)s 
+ — 
tr(A) 
s’ —tr(A)s+detA =0 


tr(A) + \/tr2(A) — 4det(A) 


S12= . 
2 


1,82 can be real numbers but also complex numbers of 
the form s = Re(s) +iJm(s) with i= /—1. If the number is 
complex, the system has oscillating behavior. See Figure 
18 for different Eigenvalue pairings and the consequen- 
tial systems behavior. Additionally, we can also classify 
the systems according to the Final Value Theorem (FVT). 
Eigenvalues on the imaginary axis and the right half of 
the plane indicate that the system does not converge to a 
value and is thus not classified with the FVT. We classify 
the system type according to the number of Eigenval- 
ues in the origin. If the real part of the Eigenvalues is 
on the left-hand side, we have a Type 0 system. If one 
Eigenvalue is on the origin and the rest is in the left-half 
plane, we have a Type 1 system that is going to have a 
real finite number, as shown in Figure 17 middle. With 
each additional Eigenvalue at the origin, we increase the 
system type order and get unstable behavior. 

YouTube: Final Value Theorem (up to 8:17). 


Im(s) Im(s) 
| Re(s) | Re(s) 


Im(s) Im(s) 

| Re(s) | Re(s) 
Im(s) Im(s) 

| Re(s) | Re(s) 


Figure 18. Eigenvalues in the frequency domain are 
shown in the s-plane. Each Eigenvalue has a real and 
imaginary part: s=o+io withIm(s) =i, and Re(s) =o 


Example 7: Eigenvalues and stability 


a) 


We search for the Eigenvalues with: 


det(A — sI) =0 


(3—s)(5—s)+2=0 
s°—8s+17=0 


2 
—8 —8 
S12= a (=) —17 


=>512=4+i 


— unstable focus 
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b) 


Co) (2) 


— stable node 


H-6 DE) 


$1 =5352 =—-1 + unstable saddle 
d) 
Bal = —1 3 XxX] 
xo} \-1 1 x2 
> 512 =0+ V2i —>+ metastable limit cycle 


Stability classification 

We can classify the stability properties via the Eigenval- 
ues, as summarized in Figure 18. For 2x2 matrices, we 
can also use the trace and the determinant of the system 
matrix as shown in the Poincaré diagram in Figure 21: 


Fs tr(A) + \/tr2(A) — 4det(A) 


2 


with the distance between Eigenvalues: 


A=: ,/tr?(A) —4det(A) =0 


det(A) = ra) 


to get the threshold above which we gain imaginary 
numbers. 


4,42 < 0 then det(A) > 0,tr(A) <0 — stable node 


A1,A2 > 0 then det(A) > 0,tr(A) > 0 — unstable 
node 


A, <0,A2 > 0 then det(A) < 0 —> saddle node with 
one stable and one unstable direction 


Re(A,) = Re(Az) < 0 with no vanishing imaginary 

part. Then det(A) > 0, tr(A) < 0,tr?(A) <4det(A) stable 
spiral 

Re(A,) = Re(Az) > 0 with no vanishing imaginary 

part. then det(A) > 0,tr(A) > 0,tr?(A) <4det(A) > unstable 
spiral 


Slopefield 

A slopefield describes the slope at each point in the phase 
plot and thereby is an informative visual representation 
of the system dynamics. 

YouTube: Slope fields 
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overdamped 


critically damped 
-— == 


amplitude 


underdamped 


time 
Figure 19. An ODE system can show various behavior de- 
pending on the damping ratio. The higher the damping 
ratio is, the more the oscillatory behaviour is reduced. 
See Example 8. 


Example 8: Damping 


For a deeper understanding of the origin of real 
and imaginary Eigenvalues and the respective sys- 
tem behavior, it is beneficial to look at damping, as 
shown in Figure 19. Damping reduces the impact 
of the oscillatory behavior. In physical systems it 
is achieved by the dissipation of energy stored in 
the oscillatory system. Let’s assume a system has 
the following characteristic equation: 


s4+2s+6=0. 


The quadratic equation can be solved with the p-q- 
equation: 


sy2=-lt 1-¢ 


We see that the first term indicates negative Eigen- 
values at the first glimpse. This remains the case 
as long 0 < ¢ < 1. If we have € < 0 we have one 
positive Eigenvalue and thus an unstable saddle. If 
¢ > 1, we obtain a negative value below the square 
root and the term turns into the imaginary part 
+ai of the complex number s. Consequently, we 
have oscillatory behaviour. Thus ¢ has a damping 
function with: 


0<¢<1 overdamped 
¢ =1 critically damped 
¢ >1 underdamped 


The smooth transition within the p-q equation and 
the symbol + explains why the imaginary parts of 
the complex Eigenvalues arise pairwise with the 
same real part in Figure 18. This only applies to 
simple systems with a 2x2 system matrix A. 
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1.7 @ Phase portrait: How elements relate to each other 
Let’s assume we have a stable system with two elements 
x; and x2. 


We do not only want to know how the elements change 
over time. We also want to know how the elements relate 
to each other: 


dxo(t) _ fa(xi(t),x2(¢)) 
dx\(t) — fi(xi(t),x2(t)) 


by making use of the relationship: 


X2 (t) an dx2 dt = dx2 


x1 (t) a dt dx dx, 


Put in simple terms, the time derivative has been can- 
celled out and we have no time axis anymore, as illus- 
trated in Figure 20. But time is still present, because 
at each time point and for each state magnitude x2 has 
an allocated state magnitude x;. For the whole time, it 
could be that the system has passed different magni- 
tudes of x2 at a particular magnitude of x. Moreover, 
there might be several different values for x; at the mag- 
nitude x2. But let us go back to the start. You can now 
choose which initial values you would like to start with 
[x(t = 0),x2(t = 0)]. From this point on, the system will 
move in the phase plot until it reaches a stable state 
where it will remain forever [x(t — °),x2(t — 0)]. The 
path which the system takes until it arrives a stable point 
is called a trajectory. 


Figure 20. Phase portrait. The axes represent the mag- 
nitude of the elements x),x2. The trajectories start from 
[x1 (t = 0),x2(t = 0)] and move over time to a stable steady 
state [x1 (t — °),x2(t — o)]. In this way, we can plot our 
system as the change of element x2 over the change of 
the element x;. One could also decide to show the rela- 
tionship the other way around, e.g., the phase plot as the 
change of x; over the change of x2. 


Definition 4. Trajectory: Set of all state points (x),x2) in 
the state plane starting from fg or t = 0. 
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Interestingly, not all initial points might lead to the 

same steady state. Depending on your initial point, the 
trajectories might be shaped differently and might land 
at different steady states, given that you have multiple 
stable points. Thus, it is interesting to look at several 
trajectories which start at different initial points. 
If we want to draw a phase plot by hand, we would 
greatly benefit from orientation lines. Therefore, we 
determine asymptotes with the most simple cases as 
being: 

e Vertical asymptotes (vertical tangents) 


e Horizontal asymptotes (horizontal tangents) 


e Oblique asymptotes. 
What horizontal and vertical means depends on which 
element you want to study in dependence on the other. 
We remain in the general equations shown above (see 
also Figure 20). We obtain the horizontal and vertical 
tangents with the following ansatz: 


e Horizontal tangent ("no change in the x2-direction"): 


f2(1,22) =0 (1.1) 


e Vertical tangent ("no change in the x-direction"): 


fi(m1,»2) =0 (1.2) 


For the oblique asymptote, we introduce the general 
linear equation into our phase plot equation: 


d (x2 = mx)) fa(x1, (x2 = mx;)) 
dx| fi (x1, (x2 = mx1))’ 


The obtained values for m are set back in the linear equa- 
tion x2 = mx, and represent the oblique asymptotes. For 
a deeper discussion of this topic, please refer to the exer- 
cises at the end of this chapter. 


Definition 5. Nullclines: Curves on which the trajecto- 
ries have the derivative 0. 
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det(A) 


x 
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tr(A) 


saddle 
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Figure 21. Poincaré diagram for 2x2 matrices: Classification of phase portraits in the (det(A),tr(A)) plane. Credit to 
Gernot Salzer, No rights reserved. 


1.8 Hi Feedback revisited 

In the next step we will use the introduced mathematical 
concepts and analyze the systemic properties of some 
network motifs discussed earlier. We will thereby see 
how and when these motifs give rise to characteristic 
non-linear dynamic phenomena. 


Sequential 
k k. kz 
i , A ay B t3 
Positive feedback 
k,>0 
ky kp k3 
—A—>B — 


Negative feedback 
kg <0 


k 5 ke 
ul A I 2 B v3 
Figure 22. Network structures of the feedback examples 
as analyzed in Example 9. 


Example 9: Feedback 


In this example we will perform a detailed, mathe- 
matical analysis of the effects of negative and posi- 
tive feedback in a simple metabolic pathway (Fig- 
ure 22). We start with the general stoichiometric 
matrix: 


Vy v2 v3 V4 
N=]1 -1 0O IA 
Oo 1 -1 O|B 


We have 2 equations (m= 2) and 4 reactions (r= 4). 
The rank is 2 (rk(N) = 2). This means, we can set 
2 unknowns k to arbitrary numbers to find 1 of 
the many possible solutions. We decide to take 
the influx k, as control u and to set k4 to different 
values: 


ka <0: negative feedback 
kg =0: sequential (no feedback) 
k4 >0: positive feedback 


We use mass action kinetics to set up an ODE sys- 
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tem for the concentrations of A and B: 


. u 
A\ _/1 —-!l 0 1 koA 
B}) \O 1 -1 0 k3B 
~~ ONO ka B 
Ss N 


—ky ka \ (A 1 
—————$<—<—_ ~~ en 
x A x u 


which is the ODE system: 


A=—hA+khB+u 
B= kA—k3B 


Let’s interpret these equations: We have an influx 
u, which positively influences the change of A. A 
part of the substance A turns into B with reaction 
rate ky. If term k2A is subtracted from the first equa- 
tion, it negatively influences the change of A. The 
same term appears in the equation for the change 
of B. This ensures that no mass is lost. Thus, the 
term k2A positively influences the change of B as it 
negatively influences the change of A, which makes 
sense because we want to turn substance A into sub- 
stance B. The substance B leaves the system with 
rate kz. One can see that the loss of substance of B 
increases with the abundance of B. The feedback 
via k4B influences A depending on the abundance 
of B. This interaction is modeled as signaling flow. 
B "only" transmits a signal but is not converted. 
This means that it does not preserve the mass con- 
servation at this point and it does not reduce B by 
kB. 

This is a fundamental and important difference: 


e Preserve mass conservation — substrate flow 


e Not preserve mass conservation — informa- 
tion flow. 


We will now look at the stability of system matrix 
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A via the Eigenvalues: 


—ko k4 = s 0 
ko —kz 0 s 
(ko +s) (ks +s) —kok, =0 
S° + (ky +k3)s + ko (kg — ka) =0 


We study the case kz = k3 = 1: 
s?+25+(1—ky) =0 
The related p-q equation is: 
sg =—-1t V1-(1—-&) =-14 Vig 


Sequential setting ky = 0: 
Both Eigenvalues are negative 


=> =—-lm2=-1 stable (Type 0) 
Positive feedback (ky = 2): 

=> S$, = 0.4142; S, =—2.41 unstable saddle 
Positive feedback (ky = 1): 

=> S$, =0; S. =-—2 metastable (Type 
1) 


Negative feedback (k4 = —1): 
=>S,2=-1Hti 
Damped oscillations, stable (Type 0) 


The following video show the simulation of the 
dynamic behavior of this pathway with different 
strengths of negative or positive feedback: 
YouTube: Example simulated 


1.9 @ Bifurcation: Split it baby! 

Adding to Example 9, we will now tackle the mathemat- 

ical analysis of another example model of feedback. We 

will, in addition, analyze how the obtained steady states 

and dynamic properties depend on the value of a system 

parameter. Simple feedback can also be modeled as: 
X= —kneg -X kneg > 0, x(t =0) = 1 

which turns out to be a stable steady state, as shown in 

Figure 16. 

The time-course is: 


x(t) =xq-e ‘nes 


as shown in Figure 23 left. Positive feedback, however, 
might destabilize a system. Let’s have a look at combined 
negative and positive feedback 

X= —kneg-X+Kpos +x kneg, kpos > 0; x(0) = 1 
whereby the variable x is the cause and the change x is 
the consequence within the feedback loop. The related 
solution of this differential equation is: 


x(t) = xp ettos~hneg) 


As you see in Figure 23 right below, the system becomes 
unstable if the parameter exceeds certain values. How- 
ever, healthy biological systems are never globally un- 
stable, because they run into saturation. Malignancies 
like cancer and virus production, however, can be seen 
as biological systems that might lead to global system 
failure, if outbalancing feedback (medicament, immune 
system) are too weak to control the disease. 


a5 kneg = 3 


x(t) =Xo° eKpos—kneg)t 


Kyos = 9 
) pos 
~e) x(t) =X° e~Kneg't 


neg 3 Kpos 
Figure 23. Feedback: On the left-hand side the behavior 
of simple negative feedback is shown. On the right-hand 
side the behavior of a system with fixed negative feed- 
back kneg = 3 is shown with different strengths of positive 
feedback. 
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f(x) 
6- 


1 
0.5 k 
pos 
Figure 24. Feedback with saturation: The Michaelis- 
Menten equation stabilizes the system, so that we get a 
stable state for each k,.5. Compare with Figure 23. 


1 4 


Kos 

Figure 25. Feedback with complex behavior: Here we 
get several stable states, but also hysteresis. Compare 
with Figure 23 and 24. 


We can use the Michaelis-Menten function to mimic a 
saturation for the positive feedback: 


Kyos *X 
; pos 
X= —Kneg X44 
Kpos *X+1 
Vmax 
6 -x 
= —Kneg > x4 
‘4 T 
x+ Ukpos 
SY 
Kn 


The solution for the steady state(s) is: 


—knee : Kpos Cae _ Kneg 4 6K pos x =0 
. (—Rnge Kpos i Kneg 7 6k pos) =0 
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Steady states: 
Case 1) 
Case 2) — Knegkposx* + 6Kpos — Kneg = 0 
6K nos — Knee 
= xs a pos neg 
Kpos . kneg 
6kyos —3 2K yoy — 1 1 
For knee =3:2 = epee a = 
i 2 =< Bias Kings Rie 


Stability: Linearization 


Kpos*X 
Kos *X+1 
(Kpoet = Rigor Knot) 
(Kposx* — 1)? 


X= —kneg*X+6- 


Ax 


Koos 
=> At = —kneg- Ax +6 


Which roughly gives the steady state x} = 0 
At = —KnegAx ae OK pos Ax = (6k pos ot Kneg Ax 
For Kneg = 3: 


Ax = (6K pos —3)Ax 
<0 for kpos < 1/2 
>0 for Kpos > 1/2 


Stability of this steady state? 


S = OKpos — 3 
x; unstable for kpos > 1/2 
stable for kpos < 1/2 


For an illustration of how the steady state(s) and their 
stability depends on parameter value kp,s (bifurcation 
plot), see Figure 24. The system is not simply unstable 
beyond a parameter threshold. It changes to the closest 
stable steady state. 

When combining negative feedback and positive feed- 
back with saturation and some basal activation, a bistable 
system can be obtained (Figure 25). Here, which steady 
state is reached depends on the parameter value kyo as 
well as on the initial conditions: 


(hoor? 
t= 2 —Kyegpx +6- ——— 
x 0 negX + (hee? +1 
a 


f; pos (x) 


basal level 


Summary of key points here: 
e Two stable steady states > bistability 


e Increase of steady state with kpo; not continuous 
anymore, but step-like. (See next section on bifur- 
cation for more details.) 
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Neg. feedback — stabilizing 

— oscillating (not shown here) 
— (often) destabilizing 

— bifurcations 

— bistability, switching behavior 


— hysteresis 


Pos. feedback 


e Contains no information about how long the sys- 
tem needs to reach steady states 


e Hysteresis: influence of history. Consequence per- 
sists after cause vanishes. 


The plot at the bottom of Figure 23 shows the change 
of a system’s steady state depending on parameter value. 
In the bottom right part of the figure, we see that the 
system suddenly changes to an unstable steady state, 
although the previous behavior was approaching an 
equilibrium. This unstable steady state can lead to differ- 
ent directions—the trajectories split up. The trajectories 
could split up several times until we, with certain param- 
eter sets, reach chaotic system behavior (not shown). 


Definition 6. Bifurcation: Qualitative change of system 
behavior with parameter displacement. 


This is relevant if: 


e parameter unknown (welcome to Systems Biol- 


ogy) 


e parameter is influenceable (drug target?) 


A double negative feedback loop, the toggle switch, is 
acommon motif in differentiation biology, as shown in 
Figure 26. Waddington’s” epigenetic landscape is a visu- 
alization of the bifurcation valleys and laid a cornerstone 
of systems biology, epigenetics, and developmental bi- 
ology. A change of parameters changes the landscape 
shape. Look back also to Figures 3 and 4 where a dotted 
line marks the unstable regions. The individual trajec- 
tories might go to the adjacent stable stem cell states. If 
you are as excited and curious as we are, read this paper 
about bistability, bifurcations, and Waddington’s epige- 
netic landscape [12]. 


Mathematically, a local bifurcation (xo, 49) appears 
to be a continuous dynamic system: 


x= f(x,A) 


is linearized around a certain fixed point, and the ob- 
tained Jacobian reveals an Eigenvalue with a zero real 


3 British biologist and philosopher Conrad Hal Waddington (1905— 
1975). 


state space A 
SMALL CIRCUIT network states with 
(2-gene genome) = gene expression profiles epigenetic landscape 
\are stable 
Archtecture @ 2 en ee ae 
("wiring diagram”) 5 
———————— = = 
| x) ] 
® © 4 : 
| x g a 
| \ 5 2 > g 3 
expression level gS @y |{i ij} 
<_ €2@ 
high low B 
B 
COMPLEX NETWORK _(N gene genome) 
each network state is 
a point in N-dimensional state space. 
State space 
(projected to a 2D plane) 
with epigeneti : 
Epigenetic barrier Pig natic landscape 
(can be lowered developmental 
by mutation) trajectories 
Cancer attractor 


(preexisting or 
generated by 
mutation, 
ormally unused) 


“Potential” 
=elevation 
reflecting 
stability 

of a state 


each point is a 
network state: Le 
Gd 
unstable 
= transient state 
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fh Eh 


stable 
= another mature cell type 


stable 
= mature cell type 


Figure 26. Bifurcation example: (A) depending on the initial values of the protein level states in the toggle switch 
motif, we can have a stable or unstable situation. An unstable state can fall into the one or the other direction. (B) 
Developmental trajectories follow the imaginary epigenetic landscape of Conrad Waddington. Adapted from [13]. 


Copyright © 2009, Elsevier Ltd. 
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part. If you get stuck here, do not worry. Come back 
later after you have worked through the rest of the doc- 
ument. The Poincaré-~Andronov—Hopf bifurcation,*°° 
or (in short) Hopf bifurcation, is an interesting case 
because, beyond this point, the solution becomes os- 
cillatory (complex Eigenvalues). Biological examples 
are the Lotka—Volterra model for the interplay of preda- 
tors and prey, the Hodgkin— Huxley model of neuro- 
electrophysiology, and the Selkov model of glycolysis 
[14]. 

Also check out the time profiles and bifurcations as oc- 
curring in a cancer-signaling network (Figure 27). 


* Austrian-Hungarian mathematician and astronomer Eberhard 
Frederich Ferdinand Hopf (1902—1983). 

5 French mathematician, theoretical physicist, engineer, and philoso- 
pher Henri Poincaré (1854—1912). 

6 Soviet physicist Aleksandr Aleksandrovich Andronov (1901— 
1952). 
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Bistability ay ie 


Switch-like 
a A A 


Time 
DNA damage 
response 


Growth factors 
(e.g.EGF, VEGF) 


Proliferation zB 
Angiogenesis : Cell cycle 
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Figure 27. Time profiles in cancer: "Intracellular signaling regulating cancer associated phenotypes and dynamic 
properties of key signaling proteins. The main signaling cascades regulating cancer hallmarks focused on in this 
review are depicted. Key signaling proteins in each signaling pathway have various dynamic properties. Cell cycle 
drivers such as CycD/CDK4 and CycE/CDK2 show oscillatory dynamics, whereas Rb, which is involved in entry for 
S phase, shows switch-like response. Caspase 3, which mediates programmed cell death (also known as apoptosis), 
also shows switch-like activation. The tumor suppressor gene p53 displays various dynamics, e.g., oscillation, in 
response to DNA damage [...]. Racl and RhoA, which are involved in epithelial-mesenchymal transition (EMT) and 
cell motility show bistability. Akt and ERK, which play a key role in multiple cellular processes such as proliferation, 
differentiation and cell death display sustained and transient dynamics, respectively". Direct quote and source: [15]. 
Copyright © 2017, Elsevier Ltd. 
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1.10 & Simulation-based analysis of motifs 

As mentioned earlier in Example 1 to 6, computer simu- 
lation is the method of choice to analyze large or com- 
plex dynamical systems. For such systems, analytical 
solutions cannot usually be obtained anymore. For the 
simulation within a computational framework, the set 
of balance equations (ODEs) has to be supplied in a 
specific form. These equations will then be numerically 
integrated and the time-course behavior for a given set of 
initial conditions and parameters will be approximated. 
By repeatedly simulating a system with different initial 
conditions and parameters, an overview on the overall 
system behavior can be obtained. We will not introduce 
more advanced methods like Parameter Identification or 
Sensitivity Analysis here (further reading e.g. [16]), but 
will illustrate the basic simulation approach with two 
examples: 


k1p k1d 
At \ 
me \ k2dp 
\ 


l 
V+ vt 
k2p k2d 
Figure 28. Network structure of the Sniffer motif. Stimu- 
lus S activates the production of proteins P and R which 
both also get degraded. The degradation of P also de- 


pends on P. The respective parameter names are given 
in italics. 


Example 10: Change detection & adaptation 


The Sniffer motif as depicted in Figure 28 is a small 
motif consisting of two proteins and an activat- 
ing input stimulus [17]. The respective balance 
equations are incorporated in the computer model 
which is given below in the IQM toolbox format 
[5]. Other tools use similar but slightly different 
formats. This model allows to simulate the system 
behavior, i.e. the behavior of the output as a func- 
tion of the input (Figure 29). From this we can see 
that the response shows perfect adaptation to vary- 
ing inputs, signaling that there is change but then 
resettling at the same steady state and being ready 
to newly respond to another change. Understand- 
ing this function of the motif is on a different level 
compared to just drawing the network interactions. 
For further reading we recommend the full paper 
that this extract comes from [17]. 
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kKKKKKKKK** MODEL NAME 
Sniffer 


kKkKKKKKKK*K* MODEL STATES 
d/dt(P) = klp*S - kldx«P 
d/dt(R) = k2p*S - k2d*R - k2dp*R«P 


kKkKkKKKKKKK* MODEL PARAMETERS 
S=0 
klp = 
kld = 
k2p = 
k2d = 
k2dp = 1 


OrFPRER 


50 100 
time 
Figure 29. Dynamics of the Sniffer motif. The time 
course of output R is depicted (right) as a consequence 
of the varying input S (left). 


time 


Example 11: Toggle switch 


The toggle switch motif is depicted in Figure 26 
and consists of two proteins which are mutually re- 
pressing each other’s activity, resulting in a double- 
negative feedback loop. This motif was nicely an- 
alyzed by Huang et al. [18] and we recommend 
reading the paper. One of the variants of the motifs 
is modeled with the following ODEs: 


xt thb} k 

xX, = ay: ———_ + hb} .———__—- ky -x 

PO hat x O' nbe e 
xR thb 

eT, eras” aR ey ee a” - 

2 Ong pa the 


With the MATLAB code below, using this model 
and analyzing (with repeated simulations) the ef- 
fect of the degradation parameter value k = k1 = k2 
on the number and exact value of the steady states, 
one obtains the bifurcation plot as shown in Fig- 
ure 30. The system is bistable for parameter values 
k < 1.8. This results in a phase plot with two stable 
steady states and one unstable steady state (Figure 
31). The slopefield indicates the regions of attrac- 
tion towards one or the other stable steady state. 
For parameter values k > 1.8 the bistability disap- 
pears and the system has only one stable steady 
state (Figure 32). The respective phase plot shows 
only that steady state with all the slopefields point- 
ing towards it. This motif plays an important role 
in many developmental processes (see Figure 26 
and [18]). 


bifurcation plot: x2... = f(k) with k=k1=k2 


blue = stable 
1.85 red = unstable 


X2 
3S 


Figure 30. Toggle switch motif. Bifurcation plot for the 
toggle switch motif (Example 11). The dependency of 
the steady state(s) for X2 as a function of parameter 
k =k1 =k2 is depicted. Stable steady states are plotted 
in blue and unstable in red. 
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phase portrait: X2(t) = f(X1t)) 
blue,red: stable & unstable steady states, dotted lines: horizontal & vertical tangents 
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Figure 31. Toggle switch motif. Phase plot for the tog- 
gle switch motif (Example 11) with slopefield. For pa- 
rameter values k = kl = k2 = | the system is showing a 
bistability. Stable steady states are plotted in blue and 
unstable in red. 


phase portrait: X2(t) = f(X1(t)) 
blue.red: stable & unstable steady states. dotted lines: horizontal & vertical tangents 


THEA 


X2(t) 
pre 


O5- eres 


x1(t) 


Figure 32. Toggle switch motif. Phase plot for the toggle 
switch motif (Example 11) with slopefield. For parame- 
ter values k = kl = k2 = 2 the system is having only one 
(stable) steady state (indicated in blue). 


au FON 


%% Toggle switch motif 

SThomas Sauter, University of Luxembourg 

clear all 

syms Xl X2 

syms al a2 thal tha2 n bl b2 thbl thb2 kl k2 

eqx1_gen=al*X1*n/(thal*n+X1*n) + ... 
bl*xthbl*n/ (thb1*n+X2*n) - k1*X1 

eqx2_gen=a2*X2*n/(tha2*n+X2*n) + ... 
b2*thb2*n/ (thb2*n+X1*n) - k2*X2 

% parameters for model A 

n=4 

k1=1 %1 %2 

k2=1 $1 %2 

thal=0.5 

tha2=0.5 

thb1=0.5 
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thb2=0.5 
al=0 %2de: 1 
a2=0 %2de: 1 
bl=1 %4a: 0.5 
b2=1 S4a: 0.5 
eqxl=eval (eqxl_gen) 
eqx2=eval (eqX2_gen) 
pretty (eqx1) 
pretty (eqx2) 
3% step 3 
[Xl_ss_sym, X2_ss_sym]=vpasolve (eqX1,eqx2,X1,X2); 
Xl_ss_tmp=Xl_ss_sym; %eval (Xl_ss_sym) 
X2_ss_tmp=X2_ss_sym; %eval(X2_ss_sym) 
Xl_ss=[]; X2_ss=[]; 
for k=l1:numel(Xl_ss_tmp) %sremove complex roots 
if (imag (Xl_ss_tmp(k))==0) && 
(imag (X2_ss_tmp (k) ) ==0) 
Xl_ss=[Xl_ss; Xl_ss_tmp(k)]; 
X2_ss=[X2_ss; X2_ss_tmp(k)]; 
end 


%% step 4 using a for loop 

syms Xl X2 

Sgeneral linearization 

Lin = jacobian([eqx1l; 

Saround steady states: 

for i=l:size(X1l_ss) 
X1=X1_ss (i) 
X2=X2_ss (i) 
eval([['Lin' num2str(i) 
eval([['eig' num2str (i) 

eig (eval (Lin))']) 


eqx2], [Xl X2]) 
using for loop 


] '= eval (Lin);"']) 
jl Ps 


end 

$% ht and vt tangents 

syms Xl X2 

[Xl_ht, X2_ht,params_ht,conditions_ht] 

=solve (eqX2,X1,X2, 'MaxDegree',5,... 

"ReturnConditions',true) 

[Xl_vt,X2_vt,params_vt, conditions_vt] 

=solve (eqX1,X1,X2, 'MaxDegree',5,... 

"ReturnConditions',true) 

%% plotting the phase portrait + numerical 
simulation of vector field 

figure; hold on 

axis([-0.1 1.5 -0.1 1.5]) %Plot range 

X1l_range=linspace(0,1.5,50); %sEvaluate all 
results in the same range 

X2_range=linspace(0,1.5,50); 

% plot steady states 

for k=1:numel (X1_ss) 


ss_plot = plot (Xl_ss(k),X2_ss(k),'«',... 
"MarkerSize',20, 'LineWidth', 4); 
eig_check=['eig' num2str(k)]; scheck 
stability 
if max (real (eval (eig_check) ))>0 
set (ss_plot, 'Color','r') %red for 
unstable (0) 
else 
set (ss_plot, 'Color','b') %blue for 
stable (<0) 


end 
end 
% plot vertical and horizontal tangents 
for i=l:size(Xl_vt) 
for k=1:numel (X2_range) 
Z=X2_range(k); 
for 1=1:numel (X2_range) 
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if (imag(eval (X1l_vt (i)))==0) && 
(imag (eval (X2_vt (i) ) )==0) 
plot (eval (Xl_vt (i)),eval(X2_vt... 
(i)),'r.', 'MarkerSize',10) 
% end 
end 
end 
end 
for i=l:size(X1_ht) 
for k=1:numel (X2_range) 
Z=X2_range(k); 
% for 1l=1:numel (X2_range) 
% X2=X2_range (1); 
if (imag(eval (X1_ht(i)))==0) && 
(imag (eval (X2_ht (i) ) )==0) 
plot (eval (X1l_ht (i)),eval (X2_ht(i)). J. 
,'g.', 'MarkerSize',10) 
end 
end 
end 
% numerical computation of example 
trajectories 
X1l_ic=[0:0.1:1.5]' 
the x-axis (X1) 
¥2_ic=[0:0.1:1.5]' 


S$initial conditions on 


S$initial conditions on 


the y-axis (X2) 
syms Xl X2 
eqX1=subs (eqX1,X1,str2sym('x(1)')); 
eqX1=subs (eqX1,X2,str2sym('x(2)')) 
eqX2=subs (eqX2,X1,str2sym('x(1)')); 
eqX2=subs (eqX2, X2, str2sym('x(2)')) 


eval (['dxdt=@(t,x,tmp) ([' 
char (eqX2) '])']) 
t_int=[0 .1] %vectorfield tend=0.1 
trajectories tend=1 
options=[]; * =tmp 
for i=l:size(Xl_ic) 
for k=1:size(X2_ic) 
{t, x] = ‘ 
ode45 (dxdt, t_int, [KXl_ic(i); 
X2_ic(k)],options) ; 
plot (x(:,1),x(:,2),'k') 
plot (Xl_ic(i),X2_ic(k),'k.',... 
"MarkerSize',5) 
end 


char(eqxl) ';' 


sfull 


end 

hold off 

title({'phase portrait: 
£€X1(t))", “blue, red: 

stable & unstable steady states, 
lines: 

horizontal & vertical tangents"'}) 

xlabel('X1(t)') 

ylabel ('X2(t)') 


x2(t} = 


dotted 


X2=Z; 


1.11 @ Additional reading: Cybernetics—The art of 
creating equilibrium in a world of constraints and 
possibilities 

Cybernetics or control engineering enables us to under- 

stand systems and to design the optimal control. The 

optimal control brings and keeps the system in a favor- 
able state. We can assume that the evolutionary pressure 
forced living systems to develop elegant control systems. 

Let’s start with the general state-space representation 

with a multi-input multi-output (MIMO) system with 
the block diagram in Figure 33. We are not only inter- 
ested in the actual system x = Ax itself, but also in how 
to control and how to observe our system. Often we do 
not know the real state values of our system. We have to 
measure the states and get an observed output y, such 
as the pERK level change in a cell culture experiment. 
Measurements, e.g. with western blot, can bias the real 
data of an untouched system. The observation matrix C 
describes which real states are pooled to our output and 
how the experimental analysis modifies the information 
of the true state values. Besides the analysis, we also 
know that it is very hard to precisely perform an experi- 
ment e.g. in cell culture. The pipette might have not been 
calibrated for a while and who knows what the clumsy 
intern from the computational department did with it. 
Thus, all the bias related to the control u is accounted 
for in the input matrix B. Additionally, not everything 
that we put into the cell culture (fetal calf serum, DMSO, 
stains) influences our system x = Ax, but might directly 
influence the observed output. Such bias is considered 
with the feedthrough matrix D. After we have explained 
the principle ideas behind the state-space representation, 
we can look at the equations for a simpler single-input 
single-output (SISO) system of Order Two: 


linear system non-linear system 


x Ax bu 


x f(x, 
Nooo  _:;F,O SSF oon 
X1|=layix1 +aj2x2|+}biu x1 |=] fi (01,x2,u) 
X2|=|a21x1 +422x2|+]} bou X2 |=] fo(x1,x2,u) 
——S =————S 
Tx g(x,u) 
x =Ax-+bu x =f(x,u) 
y =ce'x+du y =8(x,u) 


with the matrices and vectors: 
Ax & an) b= oy — (*) 
az, a22 by c2 
x(-) is called the "state vector", x(t) € R"; 


y(-) is called the "output vector", y(t) € R17; 


u(-) is called the "input (or control) vector", u(t) € R’; 
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A(-) is the "state (or system) matrix", dim[A(-)] =n xn, 
B(-) is the "input matrix", dim[B(-)] =n x p, 
C(-) is the "output matrix", dim[C(-)] =q xn, 


D(.) is the "feedthrough (or feedforward) matrix" (in 
cases where the system model does not have a di- 
rect feedthrough, D(-) is the zero matrix), dim[D(-)] = 


qx P, 


and the vector function: 
-_ fi 1 (x U ) 
ele Ge u) 
YouTube: State-space representation (all relevant) 


Systems often show oscillations and subsiding behav- 
iors over time as shown in Figure 19. The calculation 
is very difficult as we have to deal with complex inte- 
gral convolutions. As discussed in linear algebra, we 
might facilitate our problem by changing the coordinate 
system. The transfer to a frequency-based coordinate 
system is indeed a very good idea and has become the 
main tool of cybernetics. The transfer from the time do- 
main to the frequency domain allows us to treat control 
problems, for the most part, with simple algebraic oper- 
ations. The Laplace transform’ splits our function into 
harmonic sinusoids and exponential decay terms, and is 
an extension of the Fourier transform,® which dissects 
periodic functions into harmonic sinusoids only, as ani- 
mated in Figure 34. The sinusoid can either occur with 
different phase shifts or in combination with the cosines. 
The Laplace transform transfers our mathematical prob- 
lem from the time-dependent space into the frequency- 
dependent space (Figure 35) with the complex variable 
s = 0+ iq, as shown in Figure 36. The easily obtained so- 
lution in the frequency domain can be transformed back 
into the time domain by the inverse Laplace transform, 
as shown in Figure 35. The discrete time equivalent of 
the Laplace transform is the Z-transform. 


Some other important concepts in control theory are: 


Definition 7. Controllability describes the ability of an 
external input u to shift the internal state of a system 
from any initial state xo to any other final state x in a 
finite time interval. 


Definition 8. Observability is a measure of how well in- 
ternal states of a system can be inferred from knowledge 
of its external outputs. 


7 French astronomer and mathematician Pierre-Simon Laplace 
(1749—1827). 

8 French mathematician and physicist Jean-Baptiste Joseph Fourier 
(1768—1830). 
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Every input, which influences the output without influencing the system 


pERK 


Result | ** 
y 


Pipetting and 
experimental 
error modify 
control 


Measurement 


cell culture error or pooling 
of state values 


Figure 33. Block diagram of a multi-input multi-output (MIMO) system with associations to experimental cell 
biology. The operation { is an integral in the frequency space (Laplace transform) that turns the state derivatives 
into state values. See text for explanation. 


Definition 9. Transfer function: A transfer function is 
a mathematical function which gives for each possible 
input value the corresponding output value. 


Two types of principal design can be distinguished: open 
loop and a closed loop. An open loop design requires 
a complete understanding of all physics involved. An 
open loop system approaches the desired state after a 
certain time, but might not exactly reach it due to dis- 
turbances. A closed system compares the system output 
with a set point and permanently tries to reduce the 
error. A closed system reaches the set point despite dis- 
turbances and can reach the desired state faster with 
oscillations around the target. A good control system 
brings our system into the desired state in a short time 
with high precision despite possible disturbances. Watch 
the first five videos on control theory here: 

YouTube: Control theory lectures (videos 1-5). 
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Time domain 


x(t) —]| h(t) |—=> y(t) = A(t) * a(t) 
| 7 
Lael abl nverse 
aplace aplace ieaaee 


| | | 
X (s)—> | A) |— Y(s) = H(s)- X(s) 


Frequency domain 


Figure 35. Time and frequency domain. In- 
put x(t), impulse-response h(t), and output 
y(t) in the time domain, and their analogues 
(X(s),H(s),Y(s)) in the frequency domain. We 
have complicated convolution operations (*) 
in the time domain, and we have simple alge- 
braic operations (-) in the frequency domain. 
Picture source: Wikimedia, Licence: CCO 1.0 
Universal Public Domain Dedication. 


Figure 34. Fourier transform: a periodic func- 
tion in the time domain (f, red, magnitude 
over time) can be split into several sinus and co- 
sinus functions or several sinus functions with 
phase shift. These are peaks in the frequency 
domain ( a , blue, amplitude over frequency). 
Credit to Lucas V Barbosa, Wikimedia, Licence: 
Public Domain Dedication. 


time domain [) frequency domain 
o X# oscillation et x decay e~7 Imaginary (frequency) 
s-plane iw 
* 


i) Real (exponential) 


time time time 


original Fourier transform 


Laplace transform e-%e-i@t = e-(tiw)t — e-st 
Figure 36. Time and frequency domain: an ODE can be described by a number of sinusoids with phase shift or 
sinusoids and cosinoids. The process is known as the Fourier transform. This is not enough for control engineering, 
but if we represent the original ODE as a sum of sinusoids and exponential decay functions, it is sufficient. This 
process is called the Laplace transform, which allow us to work in the s-plane and frequency domain. 
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2. Basics of mathematics 


Before you go through the following chapter, consider 
watching this series of YouTube videos on calculus (3 
h): 

YouTube: Essence of calculus 


@ Solving quadratic equations 
p-q formula 
The solution of: 


x + px+q=0 


is: 


-SaV) 
X12 = ie 5 qd. 


a-b-c formula 
The solution of 


ax? +bx+c=0 


is: 
—b+ Vb? —4ac 
rn el 


@ Complex numbers 

Scientists had huge difficulties calculating the roots of 
negative numbers for a long time. The situation im- 
proved after defining i as the solution of the equation 
r=-l resulting in i? =—landi=W/—1. We get a com- 
plex number: 


z=atbi 


with a real part a and an imaginary part b. This is needed 
in this course, when you apply the p-q-equation. More- 
over, it is important to know that the imaginary part is 
directly linked to the sine function via Euler’s formula 
e'® = cos() +i: sin(@): 


z= re? 


= r(cos($) +i-sin(9)) 


et = elRe(s)t+ilm(s)t] 


= eRe(s)t . ellm(s)t] 


= eRe)" [cos(Im/(s)t) +i- sin(Im(s)t)] 


Watch Part 1 to 10 of this series on imaginary numbers 
to get a feeling: 
YouTube: Series on imaginary numbers 
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® Derivative 
The derivative is defined as: 
F +h)— f(a) 
/ = f (a 
F(@) ho h 

where the distance h becomes infinitesimally small. The 
derivative symbolizes the tangent at a function, as shown 
in Figure 37. 


f(xt+h) 


x x+h 
Figure 37. Derivative. Illustration of the derivative of a 


function is obtained. 


Differentiation 
y=x" 
d 
ay — y’ =n: xl 
dx 


Example 1: Differentiation 


3 


y=x y=4x4 


y= 3x" y= 16x? 


Product rule 


Example 2: Product rule 


y=xe 


y= 2x-eh4+x7-e 


Quotient rule 


_ u(x) numerator 
~~ Sx) denominator 
= u(x) - v(x) — u(x) - v(x) 

v(x)? 


Example 3: Quotient rule 


de (fet) (?)—(e) (f°) 
dx x? Cea 
_ (€*) G7) ~ (e*) (2x) 
x4 
_e(e-2) 
x3 


(Source: Wikipedia) 


Chain rule 
y=u(v(a)) 
with y = u(z) and z = v(x) 
Ray = BF a=W) =F 0@)') 


Example 4: Chain rule 


y = (2x+1)? 
y= 2(2x+41)-2=4(2x+1) 


@ Basic integrals 


An integral is basically the area under the curve (AUC). 
The term AUC is often used, e.g., in pharmarcokinetics. 


1 
[ea qe forn 4 —1 
n 


Ju dv=w— | vdu 
[eae 
1 


[inxax =xInx—x 


Transform one ODE of n'" order into n ODEs 
of 1% order 

Approach 

1. Introduce n new variables: 


Xj, = 
Xo =x 
x3=X 


(n) 


Xn = X 
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iCa. 63% (1 —e-1) = 0.63 


t=T t 
Figure 38. Time course behavior of an inhomogeneous 
ODE, given as Tx+x =u. The Euler number is e = 2.718. 


2. Write (n — 1) equations of the form: 
si=1...(n—1) 


Xj = Xi41 


3. Transform original ODE into 1 ODE of 1st-order by 
replacing the derivatives with x;. 


Example 5: Converting an ODE of n™ to n 1° order 
The equation of 2nd order is: 


¥+4%+5x=0 (n=2) 


With the ansatz 
| 
L= x2 = Xx} 
¥=%3 =X 


we obtain for our system: 


X] = X92 


X92 = —4xp — 5x1 


with the matrix form: 
x 1\ _ 0 1 XxX] 
Xo ~\e 5 —4 XQ 


®Transform two ODEs of 1 order into one ODE 


of 2™4 order 
Approach: 1. Solve one ODE for the other state variable: 


1 = f(x1,%2,u) — solve for x2 


2. Differentiate the solution. 
3. Insert into other ODEs. 
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Example 6: Converting ODEs from 1* to 2" order 
We begin with the equation set: 


Xy = aX] +aj2x2+ du 


Xq = a21X1 + A22xX2 + b2u 


The first equation is differentiated with respect to 
time f: 


1. 
Xo = — (41 — 411%) — bu) 
a\2 
; 1. : : 
x2 = — (#1 — ay xX — bu) 
ai2 


Transfer of the functions x2(x1,u) and %2(x;,u) to 
the second equation delivers: 


Ih ve . . 
— (| — ay1X1 — bu) =ar x, + bu 
a\2 
a7 ,, 
+ (x1 — ay1x1 — biu) 
ai2 


which in a nicer form gives: 


¥ = (ay) +.22)x1 — (a1, 422 — ay2a21) x1 
+byu+ (aj2b2 — a22b1)u 


B Linearization 

Biological systems are usually non-linear and the mea- 
sured effects rarely follow the law of additivity (Figure 
39). Still, the linearization of equations helps us for the 
following reasons: 


e Linear equations are mathematically easier to solve/ 
treat 


e Linear system properties are transferable to non- 
linear systems around the approximated area 


e A unique steady-state exists 
e They have a simple controller design 


e The response to a variety of simuli can be separated 
into linear combinations of the system down to 
individual stimuli. 


e Additivity f(x1,x2) = f(x1)+f(x2) (Figure 39) 
e Scalability c- f(x) = f(c-x1) (Figure 39) 


For the linearization, we can use the Taylor approxima- 
tion: 


f(s) =e(0") + 22 at) 
gee a) +2) |, es 


higher order terms often neglected 
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around a point of interest x*, which can be written as 
f(x) — g(x") = Ax = g'(x*)Ax 


whereby the difference between original function g(x*) 
and its approximation f(x*) is emphasized. A set of 
non-linear equations: 


can be approximated by the general form: 


ofg(x) 


Ox bas 


f(x) = g(x") + 


x* 


with the Jacobian matrix: 


ogi Ogi 
pee a 8 
OR Nae Va 
. I8m Ign 

Ox, OXp 


A linearized system in the general form is then: 
x = JAx. 


The Taylor approximation is a very important concept. 
It delivers a function that, with increasing terms, aligns 
with any continuous, smooth function around a certain 
point. However, if one moves too far away from the point 
of interest, the differences can be extreme. 

YouTube: Taylor series explained 


Because we are only interested in a linear tangent at 
a steady state x*, we neglect all higher order terms and 
look only at the function value plus the first term of the 
Taylor approximation: 


f(x) = a(x") + (e—2")8'(2") 


of x = x* + Ax with Ax = (x—x"*). 


non-linear 
F(x, + X2) # f (x1) + f (x2) 


linear 


f(x) F(x + x2) = f(%1) + Fr) f(x) 
f(%1 + x2) 
f (x2) 
£1) 
My X2 (x, + X) x 
linear linearisation 
x x 
ne fe) = Fm) ae 


x 3x, x x x 
Figure 39. Linear vs non-linear behavior. Linear systems 
show additivity and scalability. 


Tangent plane at 
PofXo, Yo: Zo} 


Curve C 
passes through 
PolXo: Yor Zo} 


Figure 40. Linearization of a multi-dimensional equation 
at one point with a plane. Source: LibreTexts Library for 
mathematics. If the point is an equilibrium point, the 


plane is horizontal. Source: LibreTexts Library. Licence: 


CC BY-NC-SA 4.0 (Fair Use). 


Example 7: Linearization 1 


First, we determine the steady state: 


g(x) =x =2x-1 
g(x) =*=2 
1 
= 2x* — = -—— 
X= 2x 0 >x 5 


Second, we approximate the function g(x) with a 


Chapter 3: The magic of change and how to find it — 35/56 


linear function f(x) at this point: 
f(x) =X = (2x* —1)4+2Ax 


1 
== (25-1) +2Ar=2Ax 


Example 8: Linearization 2 


First, we determine the steady state: 


t= (x*)2+2x*-1=05 

x —b+Vb*—4ac —2+V/444 
449 = — 

2a 2 


=-14+v2 


Second, we approximate the function g(x) with a 
linear function f(x) at this point: 


f(x) =% =((x*)? +. 2x" — 1) 4 (2x* +2) Ax 
For xf = -1+ V2: 


=*%=((-14+ V2)? +2(-14+V2)-1) 
+ (2(-1+ V2) +2)Ax 
=2V/2Ax 


For x3 = —1- V2: 


=*#=((-1- v2)? +2(-1- V2) -1) 
+ (2(—1—/2) +2) Ax 
=— 2y/2Ax 


The steady state at xj = —1+ V2 is unstable and at 
x= -1- \/2 is stable. See Figure 16. 


Example 9: Linearization 3 


First, we determine the steady state 


a 
s 
~™ 
II 
lI 
N 
S 
Ne 
| 
N 
a 
| 
an 
N 


% = 2(x*)? —2x*-12 40 
x _ + Vb? —4ac 2+V74+ 96 
“ie 2a 4 
i <5 


~ 3°" 9 


and obtain the steady states at: x} = 3 ; x, = —2 


Introduction to Systems Biology: Workbook for Flipped-Classroom Teaching 36/56 


Second, we approximate the function g(x) with a Example 10: Linearization 4 

linear function f(x) at this point: 

If we linearize a system with two non-linear equa- 
tions: 


f (x) =% =(2(x*)? — 2x* — 12) + (4x* — 2) Ax 
= %=0+4 (4x* —2)Ax 
gi(x,y) =x = (1—x—y)x 


For xj = 3: =—x7+(1—y)x 
=10Ax =—;* +xX—Xy 
x= —2x+1- 
For x5 = —2: 1 %y)s * 
81(%,y)y = —x 
=— 10Ax 82(x,y) =p = (1—x—y)y 
ey = 
The steady state at x} = 3 is unstable and at x, = —2 ce ae 
is stable. See Figure 16. =-y +y—xy 
82(x,y)x = —y 


g2(x,y)y = —2y+1—x 


g(x y) 


we approximate it with a plane at a particular point 
F (x,y) = g(x", y") +. 83.(x*,y*) -Avtgy(x*,y*) - Ay 


such as shown in Figure 40. But our points of inter- 
est are, as usual, steady-state solutions. For these 
steady-state solutions, the linearized system is: 


filx,y)=*% =O0+(1—2x* —y*)Ax—x"Ay 
fr(x,y)=4 =0+(1—2y* —x*)Ay—y*Ax 


Example 11: Linearization 5 


We want a linear approximation on a higher-order 
ODE 


X4+x+x=1 


on the steady state x* = 1. The equation is rewrit- 


Figure 41. Linearization of a function (hanging carpet) ten: 

in respect to two directions. The blue plane is horizontal 

at the steady state of function f. The red plane gives the g(x,%,*) =¥#¥+%+x-1=0, 

slopes if shifted slightly moving away from the steady 

state point P. and we treat any derivative as an independent vari- 
able. For: 


F(x, %,%) = 92") +8, 00") Ant g(x") -Ax+ ge(x*) Ax 


with 


we simply get: 


f(x, %,¥) = Ax + Aké+ A¥ =0 


Example 12: Linearization 6 


We aim for a linear approximation of a higher order 
non-linear ODE: 


g(x,%,%) =¥+44245=0 


With 


F(x, 4,4) = 90") +80") Axt g(x") Att ge(x*) Ad 


and with 
g(x") = 2x 
8:(x") = 1 
82(x") = 1 
we get: 


= 
_ 
a 


[2 


fai 


[5] 


F (x,%,%) =H" +2" +(x")? +5 4-2Ax+ AX+ A =0 
—eeuxoi~—qx 
=0 
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3. Exercises 


M@ Steady states 
Calculate the steady states for systems (3.1) to (3.8): 


x=2x-1 (3.1) 

g=x°42x-1 (3.2) 

x = sin(x) (3.3) 

K+xX4+x=1 (3.4) 

#4+2%+2°—5x+6=0 (3.5) 
d xXi\ 3. -—2 XxX] 1 

ala)-G S)la)rG)«  @@ 

x= (l-x-y)x 

y=(1-x—-y)y ae 
u=a—(b+1)u+uwv 

5 (3.8) 


v= bu-—u‘v 


@ Stability 
Test the following systems for the stability of their steady 
state(s). Draw, therefore, x over x. 


x = 3x (3.9) 
x= —5x (3.10) 
t=—x+0.5 (3.11) 
g=—342 (3.12) 


@ Eigenvalues and stability of the steady state 
Determine the Eigenvalues and the stability of steady 
states of following tasks: 


i & . i EA) (2) (3.13) 
a(-G DG) 3.14) 
a & = & i) (=) (3.15) 


Population growth 
The growth of a specific population is described with 
the following model: 
dX 
dt 


a) Derive the steady states of the system. 


= (2—X)(X—1)(X-K) 


b) Linearize the differential equation around the steady 


states. 


(withk >0 and X(0) =Xo) 
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c) Calculate the Eigenvalues of the linearized system 
in the steady states and determine thereof the sta- 
bility of the individual steady states. For which 
values of k do you get stable steady states? 


In the following, k = 5 holds: 


d) Mark within a X-over-t diagram the domains where 


dx . ie ‘ ‘ 
as positive, then respectively negative. 


e) Draw X(t) ina diagram using the results of parts 
a)-d) for different initial conditions of X. 


@ 1st-order system 
The growth of a specific population is described with 
the following model: 


dN 
dt 


=rN(2—N)(N-1) — (withr > 0 and N(O) =) 


a) Derive the steady states of the system. 


b) Linearize the differential equation around the steady 
states. 


c) Calculate the Eigenvalues of the linearized system 
in the steady states and test for stability of the in- 
dividual steady states. 


d) Mark within at, N diagram the domains where 


aN |, sie : 
a? positive, then respectively negative. 


e) Draw N(t) ina diagram using the results of parts 
a)-d) for different initial conditions of N. Also 
Calculate the position of the inflection points. 


@ Characteristic equation of a 2nd-order system 
The following characteristic equation of a 2nd-order sys- 
tem is given: 


(s—a+T)(s—1+aT)=0 (a,T are real) 


a) What is the corresponding homogeneous differen- 
tial equation? 


b) For which values of the parameters a and T do we 
have one single Eigenvalue on the stability bound? 


c) For which values of the parameters a and T is the 
system stable, metastable, or unstable? 


d) Draw the results of c) in a stability diagram with 
abscissa T and ordinate a. 


Hf Biomass growth in a bioreactor 

A batch fermentation is carried out in an ideally mixed 
tank reactor. At the beginning, substrate and biomass 
is provided with concentrations Sp = S(t = 0) and Xp = 
X(t =0). 

The reaction volume of the reactor (Vp) is considered 
to be constant. The specific growth rate of the microor- 
ganisms shall be . The yield coefficient? shall be Yxs. 
The biomass is linearly proportional to the decline in 
substrate X = —YysS and this anti-correlating behavior 
is thus a measurement of efficiency. 


a) Set up balance equations for biomass (X) and sub- 
strate (5S) concentrations. 


b) Derive S(t) as a function of X(t), Xo und So. 


c) Derive a differential equation for X which does not 
depend on the substrate concentration anymore 
[X = f(X)] using the assumption p = cS. 


d) Repeat this while assuming a logistic growth rate 
be =a-—bxX with (a,b>0). 


e) Compare the differential equations obtained for X 
inc) and d). What would a and b have to be in 
order to obtain identical differential equations? 


f) Calculate the steady states (X*,S*) of system d) 
g) Show that the following ansatz is a solution for X(t): 
— x* 

~ T+ce7e2t 


X(t) 


How do you have to choose c; and c2 (c1,c2 #0)? 


(3.16) 


@ Prey-Predator Model 1 

We have two versions of the Prey-Predator Model of 
Lotka and Volterra. Prey X grows independently of 
Predator Y and is depleted by interacting with Y. Thereby 
Predator Y grows and, in turn, is depleted by a natural 
death rate. The most simple differential equation model 
reflecting these relations is: 


X =X(1-Y) I 
Y =Y(X-1) II 


a) Assign the facts described above to the individual 
model terms in the system. 


b) Calculate the steady states of the system. 


c) How can the behavior in vicinity of the steady 
states be approximated (without calculations)? 


° "Yield based on substrate (Yx/s) or oxygen consumption (Yx/o) 
is a very important parameter. This parameter indicates how efficient 
a fermentation is. At the same time it is very closely related with 
the maintenance coefficient (m). By means of yield and maintenance 
coefficient it is possible to estimate the proportion of energy that cells 
consume in biomass and metabolites synthesis and the proportion 
of energy that allows the cells to maintain their capability for their 
biological performance." Direct quote [?]. 
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@ Prey-Predator Model 2 

In the following, a more complex Lotka-Voltera model 
shall be investigated. The growth rate of the prey is in- 
cluded with a parameter pt > 0 and the predator’s growth 
rate is modeled with a yield coefficient a > 0. Further- 
more an influx of predators into the system is assumed. 
The resulting model description is: 


X =X(u—-Y) I 
Y=Y(ax-1)+1 I 


a) Assign the facts described above to the individual 
model terms in this system. 


b) Calculate the steady states of system as a function 
of parameters @ and LL. 


c) Which condition has to be fulfilled for parame- 
ter L to obtain only biologically reasonable steady 
states? 


d) Draw the steady states for u =2 and a = [!/s; 1/4; 1/3; 1/2; 1] 


in a diagram (x-axis: X, y-axis: Y). 


 Prey-Predator Model 3 (fish) 

Small and big fishes are living in a lake. The small fishes 
(X) depend on plankton which is available in excess. Big 
fishes (Y) feed on the small ones. If only small fishes 
were present, it would grow exponentially, i.e. their num- 
ber grows with a constant specific growth rate a (a > 0). 
In the absence of small fishes, the big fishes would die 
out with a decay rate B (B > 0). The following sim- 
ple equations describe these facts using the parameters 
a,B,y,6 > 0: 


d 
Ge (4 WX I 
d 
ae = —(B —6X)Y II 


a) Determine the steady states of this simple ecologi- 
cal model. 


b) Linearize the differential equations around the steady 
states and transform the linearized equations into 
state-space description (states, inputs, outputs in 
matrix/vector representation). 


c) Determine for every steady state the Eigenvalues of 
the linearized system and deduce from that the sta- 
bility of the system in the vicinity of the respective 
steady state. 


 Prey-Predator Model 4 (fish) 

The previously introduced fish population model is mod- 
ified as follows: it is assumed that the small fishes (X) are 
furthermore depleted via social attrition effects within 
their own population. This depletion depends on the 
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probability that two small fishes meet. It is therefore 
modeled proportional to X?. The extended population 
model reads as follows with a = B = y= 6 = 1;e =0.5: 


d 2 

Ape me WY) — €X I 
d 

rAd = —(B —6X)Y II 


a) Determine the steady states of the system. 


b) Linearize the differential equations around the steady 


state and transform them into state-space descrip- 
tion (states, inputs, outputs in matrix/vector rep- 
resentation). 


c) Calculate the Eigenvalues of the linearized system 
at the steady states and determine thereof the be- 
havior of the system in the vicinity of the steady 
states. 


d) Determine the equations of the vertical and hori- 
zontal tangents. 


e) Optional: Determine the rectilinear trajectories. 


f) Sketch the phase portrait. 


B@ Reaction system 
The following reaction system is given: 


wy, Ms, ox 


X+yY oy 


ay 8s B 


The concentration of substance A shall be constant. The 
reactions take place in an ideally mixed enclosed reactor 
with constant volume. 


a) Give the balance equations for the amount of sub- 
stance (mol) of the reaction partners X and Y. 


One obtains the following differential equation system 
by specifically choosing the reaction constants kj, k2,k3: 


dcx 


ae = cx(1—cy) 
d 
A = cy(cx—cy) 


b) Determine the steady states of the system. 


c) Linearize the differential equations around the steady 


state and transform them into state-space descrip- 
tion (states, inputs, outputs in matrix/vector rep- 
resentation). 


d) Calculate the Eigenvalues of the linearized system 
at the steady states and determine thereof the be- 
havior of the system in the vicinity of the steady 
states. 
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e) Determine the equations of the vertical and hori- 
zontal tangents. 


f) Optional: Determine the rectilinear trajectories. 


g) Sketch the phase portrait. 


Notes 
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4. Solutions Task 3.6 


Do not betray yourself! d @ ) 6 7 (":) 6 
= + u 
Exercises dt \x2 1 5 x2 0 


Steady states 
Calculate the steady states for systems 3.1 to 3.8: 


is: 


3x, —2x2 +u=0 I 


Task 3.1 ee ee I 
x= 2x—1 He SS ye Ssu 
0=2x-1 . 

in I: 
g 1 
x= 
2 —15x2 — 2%. +u=0 
Task 3.2 —11x. = —u 
P 1 
x» =u 
x¥=x°+2x-1 2 7 
0O=x*+2x-1 
Back in II: 
You need the p-q equation (learn it by heart!) and then 
you get: a 
xy+ 7" 0) 
5 
2 2 = — X] S — == 
tg S=rszt = 1=-l+v2 
M2=-5 (5) + Vv 17 
Task 3.3 


x*=nn for:neZ Z = {...3—2;—1;5031;2;...} or the steady state (—5/12u, —1/17u). 


Works for integers Z. Task 3.7 
Task 3.4 x= (1-—x-y)x I 
rae y=(1-x-y)y II 
¥4+xX+x=1 
x =1 I: => O=(1—x-y)x 
Case 1 
Task 3.5 
¥42¢4+27 —5x+6=0 =>|xt =0 
x —5x+6=0 in Il: 
(x—2)(x-3) =0 (1—y)y =0=[¥f =O] and [¥z = 1] 
x} =3 
i =2 The steady states are (0;0) & (0;1). 
Alternatively, with p-q equation: Case 2: 
5 ag (l—x 
t,=544/(=] - —y)=0 
X12 5> (5) 6 a 
x=1- 
= a. h yH=1l-x 
ae 


in II: 
(l—1+y—y)y =0 
0-y=0 — always true 
y* =c any desired value 


With this we have the steady states (1 — y;c) & (c;1—x) 
which includes the steady state (0; 1). 


Task 3.8 


u=a—(b+1)utuyv 


v= bu—wvy 


becomes: 


u=ve—(b+l)uta=0 I 
v =u(vu—b) =0 Il 


Case 1: 
Il if u=0 
inI = Sif a#0 

Case 2: 

I if uO 

b 

> v=- 

u 


=> bu—u—bu+a=0 


with the two solutions: 


@ Stability 

Test the following systems for stability of their steady 
state(s). Draw, therefore, x over x. See Figure 42. 

Case 1: Stable 


* 


if x>x 
then x<0O 
or 
if x<x* 
than x>0 
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x = 3x x= —5x 
5 5 
Oe is a, cece a “> stable 
KX] eae 8 x 
ie 
Ss. 
5 5 
1 0.5 0 0.5 1 1 -0.5 0 0.5 1 
x x 
1 
k= -x +5 
0.5 2 


x x 
Figure 42. Plot of change of x over state x. Stable: if the 
state x is positive, the change x makes it more negative. 
Unstable: if the state x is positive, the change x makes it 
even more positive. 


Case 2: Unstable 


if x>x* 
x>0 
or 

if x<x* 


x<0 


@ Eigenvalues and stability of the steady state 
Task 3.13 


d xX1\ —l1 4 XxX] 
dt \x2) \ 0 —3) \x.2 
sl =—1;s2 = —3: stable 


Task 3.14 


d xX1\ 3 8 XxX] 
dt \xo.) \1 1) \x 
sl =—1; 52 =5: unstable 


Task 3.15 


d X\ [7 1 3 XxX] 
dt X2 ~ le 1 1 XQ 
sl =iV2; s2 = —i/2: metastable oscillations 


@Population growth 
The growth of a specific population is described with 
the following model: 


dX 
dt 


=(2—X)(X—1)(X—-k) — (withk>0 and X(0) = Xo) 
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Taska) Derive the steady states of the system. 


O-—2)@ =a") <0 
> ET) =| HEH 


Task b) Linearize the differential equation around the 
steady states. 


X = (2X —2— X74 X)(X-—k) 
= (3X —2—X?)(X —k) 
= (3X? — 3kX — 2X + 2k —X° + kX?) 
= —X3+X7(3+k)+X(—3k—2)+2k 
AX = [—3(X*)? + 2X*(3+k) — 3k—2] -AX 
Task c) Calculate the Eigenvalues of the linearized sys- 
tem in the steady states and determine thereof the sta- 


bility of the individual steady states. For which values 
of k do you get stable steady states? 


Steady State 1 (X* = 1): 


=> s=-3+42(34+k)—3k-2 
= —3464+2k-3k-2=1-k 


=> stable for k > 1. 
Steady State 2 (X* = 2): 


=> s=—12+44(34+k)—3k-2 
=k-2 


=> stable for k < 2. 
Steady State 3 (X* =k): 


> s=—3k4+2k(34+k)—3k-2 
= —3k° +6k+2k —3k—2 
=k? 4+3k-2 


=> stable 


s=—-kP4+3k—-2<0 
ke —3k+2>0 
(k—1)(k—2) >0 


Solution 1: k-1>0 OR k-—2>0 
=>k>2 

Solution 2: k-1<0 OR k-2<0 
=k<l1 
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Task d) and Taske) Mark within a X-over-t diagram the 
; dx . PD 
domains where as positive, then respectively nega- 


tive. One parameter is fixed k =5. Draw X(t) ina dia- 
gram using the results of parts a)-d) for different initial 
conditions of X. One parameter is fixed k = 5. 


@ 1st-order system 
The growth of a specific population is described with 
the following model: 
dN 
dt 


Taska) Derive the steady states of the system. 


& = rN(2—N)(N-1) =0= [Ni =o} [Ns = 1}[N3 = 2] 


Task b) Linearize the differential equation around the 
steady states. 
The equation can also be written in the following form: 


(with r >0 and N(0) =No) 


= rN(2—N)(N-1) 


N=rN(2N—2—N?+N) 
= rN(-N? +3N—2) 
= —rN? +3rN? — 2mN. 


With only small deflections n, one can linearize ODEs 
around the steady states: 

N=N;+n for i=1,2,3 
According to Taylor: 


Of(N) 
ON 


d ok 
© (N* +n) = FON) [yp to Ip mtn. 
SS 


dt 

=0 

After the implementation of the non-linear equation into 
the Taylor equation, one obtains: 

d 


—(N* +n) = 


—r(N*)? + 3r(N*)* —2rN* 
dt ———___,- —_____ 


0 
+ ((—3(N*)? + 6N* 


2)r)-n 


d 
Steady State 1: Nj =0—- Fa —2rn 


d 
Steady State 2: Nj =1— —n=rn 


dt 
d 
Steady State 3: N3 =2— a —2rn 
Task c) Calculate the Eigenvalues of the linearized sys- 


tem in the steady states and test for stability of the indi- 
vidual steady states. 


Steady State 1: The Eigenvalue is s=-2r and is negative 
because r > 0. The Steady State 1 is thus asymptot- 
ically stable. 


Steady State 2: Because the Eigenvalue s=r is always 
positive, the Steady State 2 is unstable. 


Steady State 3: Similar to Steady State 1 and thus asymp- 
totically stable. 


Task d) Mark within a t— N—diagram the domains 


dN , oe . : 
where a is positive, then respectively negative. 


N 


In the second plot (N over N) one can see very well 
where the derivative becomes negative or positive. 


Task e) Draw N(t) in a diagram using the results of 
parts a)-d) for different initial conditions of N. Calculate 
also the position of the turning points. 

For the turning points, we need the second derivative: 


d? d d 
ae a” 
=r|(2—N)(N—1)-N(N-1)+N(2—N)|N=0 


This solution of the equation gives us only trivial solu- 
tions N = 0. This does not mean that the steady states 
are inflection points at the same time, because the third 
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derivative is zero for equilibrium points. To sketch the 
curves we are only interested in the solutions: 


r[(2—N)(N—1)—N(N—1)+N(2—N)] =0 
—~3N*+6N—2=0 
N? —2rN +.2/3r =0 
N? -2N +2/3=0 


With: 


2+ 4— B_14¥3 
2 3 
=> nj, = 1.58, np =0.42 


=> Ni2= 


we obtain the solution: 


N=1+— 


Now one can sketch some solutions for the time-course 
behavior with N(t) over ¢. 


@ Characteristic equation of a 2nd-order system 
The following characteristic equation of a 2nd-order sys- 
tem is given: 


(s—a+T)(s—1+aT)=0 (a,T are real) I 
Taska) What is the corresponding homogeneous dif- 


ferential equation? Characteristic equation with formal 
ansatz: 


s°+(T —a+aT —1)s+(T—a)(aT -1)= 
s?+(T—1)(1+a)s+(T—a)(aT -1)= 


Homogeneous ODE: 


¥-+(T —1)(1+4)%+ (T —a)(aT —1)x=0 
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Task b) For which values of the parameters a and T do 
we have one single Eigenvalue on the stability bound? 


From Equation I we get: 


Sj =a—T 
So = 1-—aT 


An Eigenvalue lays on the stability margin, if: 


=T|)>5,=0, s.=1-T*40 TAH 


1.) 
1 1 
2.) Je=F]>s=0 s1=7F0 


Task c) For which values of the parameters a and T is 
the system stable, metastable, or unstable? 


| 
_ 


=TFHt 


1.) Asymptotic stability exists for s1} <0 and s2 < 0. 


a-T<0O and 1l-—aT<0 —~ aT>I1 


1 

Case 1: a> AND T>0 
1 

Case 2: a<a AND T <0 


2.) Metastability exists if one Eigenvalue lies on the 
stability margin and the rest is negative. 


a<T 


a) a=T and T?>1 +[T<-1 ORT>1 
1 1 


Case 1: |7 >0 AND T>1 
Case 2: |T <0 AND T>-1 


3.) In the remaining cases we have no stability. 


Task d) Draw the results of c) in a stability diagram 
with abscissa T and ordinate a. 


unstable 


stable 


meta stable 


stable 
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Biomass growth in a bioreactor 


Task a) Set up balance equations for biomass (X) and 
substrate (5S) concentrations. 
X = ux I 
_ 1 
— Yxs 
. 1 
S=—-—px II 
Yxs . 


Task b) Derive S(t) as a function of X(t), Xo, and So. 


Equation I > [ . = i Ldt 

[In(X Ix, = [Mt] 

In(X) —In(Xo) = [1-15 
In(X/Xo) = bt 


1 
Equation II > ° aa - Xo: elt 
xs 
1 
dS = —>—- Xo el dt 
Xs 
S t 
as=—-E%. | et! . dt 
So Yys Jo 
Ss HX 1 t 
Sls = ye FF [e"To 
Xo 
S—So= eM _ | 
o=—y¥ ) 
1 
$= S)- ~ (X(t) — Xo] 
Yxs 
or simply: 
dS 1 
— = + px 
dt Yys e 
4 ae 
a Yys dt 
dS = _— -dX 
xs 
S(t) 1 X(t) 
[=f & 
So Yys JX 
S 1 xX 
si? 7 ee . hae 
1 
S(t) — Sp = Yes [X(t) — Xo] 
1 
a Oe [X(t) — Xo] 


Taskc) Derive a differential equation for X which does 
not depend on the substrate concentration anymore [X = 


f(X)] using the assumption pt = cS. 


dX 
— = UX = cSX 
Yo 
1 
= c[So — a [X (t) — Xo]|X 
XS 
Task d) Repeat this while assuming a logistic growth 


rate U = a—bxX with (a,b>0). 


“ = UX = X(a—bX) = aX — bx? (4.1) 

dS 1 
= -X bX 4.2 
Fy KlA- ON) (4.2) 
Taske) Compare the differential equations obtained for 


X inc) and d). What values would you have to choose for 
aand b in order to obtain identical differential equations? 


xX Xx X 

eee Kea eS er, ey en 

dt Yys Yxg —~_—’/ 

Task d) 
Task c) 

X, Xx 

o=x| 5+ S| = X [a— bX] 
Yxs Yxs 


A comparison of coefficients delivers: 


X( 
+a=e(s+ 92) 
Yxs 


Cc 


=>b=— 
Yyx5 


Task f) Calculate the steady states (X*,S*) in task d). 


0 = (a—bX)X (4.3) 
— bx 
(ee (4.4) 
Yxs 
Equation 4.3 delivers two solutions: 
a 
x; =0 .e— 
1 2-35 
Inserting into Equation 4.4 
a—b0 a—bta 0a 
O=- 0 and 0= b ps 
XS Yys b Yys b 


shows that the substrate does not matter and that the 
steady-state values for X are valid for any substrate con- 
centration. The solution does not say anything about the 
dynamic until the steady state. One could say that either 
the reactor content is dead at the end or the growth and 
death rate balances the amount of individuals. Have 
in mind that the logistic growth equation does not de- 
pend on the "substrate" and it is the responsibility of the 
modeler to set up realistic but still simple equations. 
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Task g) Show that the following ansatz is a solution for 
X(t): 
x* 
X(t) = ——_—_ 
l+cje ot 


What values would you have to choose for c; and cz? 


The equation: 


dx 
— =(a—bX)X 
dt c ) 


has X itself and its derivative. We do not have the deriva- 
tive of the ansatz yet. Let’s do it! Therefore, we need the 
quotient rule: 


(£) _ fis-fe' 
g g 
which is applied on our ansatz: 

dX(t) _ (0. Ane) — (X*(0+cicpe~)) 


dt (1+ cye7©2!)2 


cot 


X*cjo.e 
( +cje72!)2 


Now we have an equation for X and X. We can insert the 
ansatz into Equation 4.1. 


dX 
X*ceyone 62! b x* x* 
=a 
(1+. cye7©2!)? l+cype—! } 14+cye7o2t 


and multiply with (1 + cje~©")?: 


» x" ) X*(14tcye-@)t 
1+cye~°! 1 2 
X*cyone = (a(1 +cye 2") —b)x* 
0 = (a(1+cie @) — bX* — cic2e ")X* 
= (a—bX* + (a—c2)cje ) X* 


X*cjoe % = (« 


Check the steady states 
Let’s test the first steady state [x* =o} 


0= (a—b-0+(a—c2)c1e~) -0 


Well, this equation is always true. But we are rather 
interested in the time response. 


a 
X*=— 


b 
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We could try to get more out of the ansatz by checking 

what happens at time point zero t = 0. 
x* x* 

| + ee 029 =o +c 


X(t =0) =Xo (4.5) 


which tells us that with the steady state X* = 0: 


0 
X — 
a 1l+c 
the equation is only valid if X) = 0: So it means that 
if we do not have any micro-organisms in the tank at 
the beginning, we will also have no micro-organisms at 
the end and stay in the steady state X* = 0. Not really 


interesting, is it? Remark: this steady state is unstable. 


Let’s do something useful by checking the second steady 
state. 


We introduce =F 
a? (asoe + (a—en)cre"") ; 


with which we know that: 
aQ=a 


We insert this into Equation 4.5: 


a 
5 
0 l+c) 
a 
l+q=2 
+cy Xo 
a 
ei 
oh BX 


Inserting this into our ansatz 


X* 
a 
(#) l+cje ot 
we obtain a final solution: 


7 alt 
X(t) = iia & _ 1) _ 
bXo 


Isn't this beautiful? 
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@ Prey-Predator Model 

We have 2 versions of the Prey-Predator Model of Lotka 
and Volterra. Prey X grows independently of Predator Y 
and is depleted by interacting with Y. Thereby Predator 
Y grows and, in turn, is depleted by a natural death rate. 
The most simple differential equation model reflecting 
these relations is: 


X =X(1-Y) I 
Y =Y(X-1) II 
Task a) Assign the facts described above to the individ- 


ual model terms in the system. 


birth Predator eats prey 


X= xX ~YX 
Y= YX 4 


growthonprey death 


Prey number change: 


Predator number change: 


Task b) Calculate the steady states of the system. 
X =x*(1-Y*)=0 I 
¥ =Y*(xX*-1)=0 Il 


Case differentiation: 
Equation I =|X} =0) 
in Eq. I: Y*(0—1)=0> 
Equation I =| =1] 
in Eg. I: 1-(x*-1)=0>[x*=1] 


Task c) How can the behavior in vicinity of the steady 
states be approximated (without calculations)? Calcu- 
Cases 0<Y<1;/Y>1 
0<X<l1 xX>0 |x<0 
Y<0 | Y<0 
X>1 X>0 | X<0 
¥>O |:F>0 


lation close to the steady state at the zero point: (0.1 | 
0.1): 


= X =0.1(1—0.1) =0.09 
¥ =0.1(0.1—1) = —0.09 
Phase plot with oscillating system 


@ Prey-Predator Model 2 

In the following, a more complex Lotka-Voltera model 
shall be investigated. The growth rate of the prey is in- 
cluded with a parameter ps > 0 and the predator’s growth 
rate is modeled with a yield coefficient a > 0. Further- 
more, an influx of predators into the system is assumed. 
The resulting model description is: 


X= XT) (4.6) 
Y = Y(a@X-1)+1 (4.7) 
Task a) Assign the facts described above to the individ- 


ual model terms in this system. 


predation 


—YX 
ayYx —-Y 


growthon prey death 


. birth 
X = Ux 
4 = 


Prey number change: 
Predator number change: 


Task b) Calculate the steady states of system as a func- 
tion of parameters @ and p. 

X=X*(w-y*) +0 I 

¥ =Y*(aX*—1)+1=0 Il 


Case differentiation: 


Equation I =|X} =0] 
in Eq. I: Y*(0-1)+1=05|¥f =1] 
EquationI +(u—Y*)=0 =| =u 
in Eq. Ik p-(ax*—1)+1=0 
bax* —u+1=0 


u-1 
OL 


=>|xX*= 


Task c) Which condition has to be fulfilled for parame- 
ter p1 to obtain only biologically reasonable steady states? 
We assume it is reasonable to always have enough of each 


! 
species in in the system X*,Y* > 0 then: 


y*>0 = u>0 
x*>0 = = >0 
aU 
u-1>0 
> 1 
Task d) Draw the steady states for u = 2 and a = [!/s; 


1/4; 1/3; 1/2; 1] ina diagram (x-axis: X, y-axis: Y). 


immigration 


Chapter 3: The magic of change and how to find it — 49/56 


* =1 1 * 
; 5/2 2 
1 2 2 
; 3/2 2 
: 1 2 
1 1/2 2 


Plot with parameter-dependent shift of the steady state: 


@ Prey-Predator Model 3 (fish) 

Small and big fishes are living in a lake. The small fishes 
(X) depend on plankton which is available in excess. Big 
fishes (Y) feed on the small ones. If only small fishes 
would be present, it would grow exponentially, i.e. their 
number grows with a constant specific growth rate a 
(a >0). In the absence of small fishes, the big fishes 
would die out with a decay rate B (6 > 0). The following 
simple equations describe these facts using the parame- 
ters a, B,y,6 > 0: 


d 
—X= — yY)xX = I 
ae (a@—y)X =0 
d ! 
—Y =-—(B—6xX)Y =0 I 
“y = —(6 -8X) 
Task a) Determine the steady states of this simple eco- 


logical model. 


Equation I =|x7 =0] 
in Eq. I: —(B—60)¥ =0=>|¥;/ =0] 


Equation] >(a—yY)=0 > 


in Eq. Ik — (B— 8x) =0 
a a 
x Se 
y Py 


._B 
a 


Task b) Linearize the differential equations around the 
steady states and transform the linearized equations into 


Introduction to Systems Biology: Workbook for Flipped-Classroom Teaching 


state-space description (states, inputs, outputs in ma- 
trix/vector representation). 


We assume: 
X=X*4+x 
Y=Y*+y 


The right-hand side of the non-linear equation system 
will be linearized around the steady states. If one de- 
notes the right-hand side of the first ordinary differential 
equation (ODE) as function of X and Y, one can use the 
Taylor approximation: 


d d 


dg(X,Y) 
oY 


eae [ie ses 


Applied on our system, we obtain: 


{x= (a@— yY*)-x— yxX*-y 


d * *), 
ye Or eB Ox" )*y 


The related state-space representation is: 


x\  (a—yy* —yx* x 

y}) \ 6yY* —B+6xX*] \y 
Steady State 1: X/ =0,Y;, = 0. The steady-state values 
are inserted into the state-space representation: 


G=(6 “9)G) 8) 
Steady State 1: X} = 5.15 = $ 
Cee ars 5 


Task c) Determine for every steady state the Eigenval- 
ues of the linearized system and deduce from that the 
stability of the system in the vicinity of the respective 
steady state. 

Steady State 1: 

Eigenvalues directly from the related state-space repre- 
sentation 4.8: 51 = @,s2 = —B — steady state is unstable 
Steady State 2: 

We use the state-space representation 4.9 to get the char- 
acteristic equation: 


0 ~y8 


32 9 =s*+aBp =0 
Y 


|s1—A| = | 


Thus, we obtain the complex Eigenvalues: 


Si2=ty —ap = +i,/aB 
Thus, we have a metastable steady state with conserved 
oscillations, also known as a limit cycle. 
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 Prey-Predator Model 4 (fish) 


Hepes ee 2 
go(xy)=y=(x-ly =-ytay 0s 
Task a) Determine the steady state. 
From IL: 

(x—1)y=0 [yi =0] 

Hy =1 

y, =OinI: 

x(1— =x) =0>}x7 =0 

x3 =2/>]})93 =0 

x5 = link: 

l-y : =0>]y) = ; 
Taskb) Linearize around the steady state with the equa- 
tion 


F (x,y) = g(x", y*) + ay (x* y* Ax + g(x", y*)Ay 
using x = x* + Ax with Ax = (x—x*). 


1 
= filx"y") = @"-2y"— 5") + (l-y" "ae 
y= falx*,y") = (-y* +2" y") + ("Art (x* — DAy 


(—x")Ay 


which is in the general state-space representation: 


_x* Ax @ x* — x*y* — 5 (x*)? 
x*—1} \Ay —y* +x*y* 


(5) _ (’ —y*—x* 
y y 
—>—_—__ 
Ja 28) 


ix 


and for Steady State 1 (x = O|y = 0) — (00): 


()=( 4) (3): 


Steady State 2 (1|4): 


Task c) Determine the stability with the Eigenvalues: 


|sI—A] = S—a]] a12 
a2] S— 422 


! 
= (s—aj1)(s — az2) — ay2a2; = 0 


For Steady State 1 (0]0): 


(s—1)(s+1) =0 


This characteristic polynomial delivers s; = 1, s2 = —1, 
which indicates an unstable saddle point. 
Asymptote: 
We start with the equation in the state plane: 
eee 
XxX dx x 
and introduce the linear equation y = mx and obtain a 
equation 


—mx 


m= 
X 


that only works for m = 0 


Steady State 2 (1|4) delivers the characteristic equation: 


1 a oe oe 
(sta)-st5 =s +58 5 =0 
2s? +5+1=0 
-ltvI-8 1, v7 
S1/2= = +i 
4 4-4 
which indicates a stable focus. 
Asymptote: 
We start with the equation in the state plane 
y_ dy _ ax 
xX dx — 5x —y 


and introduce the linear equation y = mx and obtain the 
equation 


m= - 
— 7X — mx 
1 2, 
——MX—- MX = =X 
2 2 
1 
O=m’?+—m+-— 
a aie 


1 1 1 -ltvV-7 
m2 = £4/ ~ 
4 16 2 4 


which delivers imaginary slope values. 
Steady State 3 (2|0) result in the characteristic equation 


(s+1)(s—1)—-0=0 51 
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and consequently is an unstable saddle. 
Asymptote: 
We start with the equation in the state plane 


y_dy__y 
c —x—2y 


and introduce the linear equation y = mx and obtain a 
equation 
MX 
m= —W— 
—x —2mx 
—mx —2m?x = mx 
—2mx —2m*x = 0 
2mx(1+m) =0 


which result in the slope values m; = 0 and mz = —1 


Taskd) Tangents 
To draw our phase plot, we calculate horizontal and 
vertical asymptotes with: 


e horizontal tangents: y =0in Equation II: 
> x=1,y=0 


e vertical tangents: x =0in Equation I: 
> x=0, y=-5x+1 


Taske) Rectilinear tangents 
10 11, 


Ansatz :y=mx+b 

d - —mx—b 2 +b 

OF peas — _ nx +mx zs x (4.10) 
dx x—xy— 5x2 x— mx — bx — 5x" 


10 Ansatz: x = my+b 


dx x—xy— 52° 
=e 
dy ~ytxy 
2 By eae 2 
__ my+b—my~ — by— x|m*y* + 2mby + b*] 
—y+my* + by 


1 1 
=> —my+my* + mby— my —b+my* +by+ sy +mby+ a” =0 


=> (m* +m+ 1 DP + (b) y+ (<b? —b) =0 
2 Ww 


Y_~_—_ 
0! ” 0! 


Coefficients: 
b, =0>m =0>[k=0] 
3 


] 
by =2 +m =1 rattles 0 2 


11 Ansatz: x =b 


=> |x=O0}. 


b=0 (forally) 
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bmx mx? = —mx — b+ mx? + bx 


2 


>mx mx? 


202) 


=>-—mx — zim? — mx? + mx — bmx + mx — bx +b = 0 


3 
m—m?)x° + (2m—bm—b) 


=> ( 5 x+ b> =0 
Coefficients: 
=[b=0] 
=>2m—0-m—0=0— 2m=0>[m=0] 
3 
Inserted in Equation 4.10: 
for m =0> 
3 
for mM, = i 
3 5x-5e 
2 x+x2 
ae 
“273 7 
Task f) Draw the phase plot. 
The phase plot can be seen in Figure 43. 
0 0.5 1 15 2 25 3 


Figure 43. Phase plot: Red dots are steady states of 
the system. Dotted lines are the vertical and horizon- 
tal tangents. Initial values are: (x = 0.2,y = 2) and 
(x =2,y =0.2). 
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@ Reaction system 
Task a) Balance equations for X and Y. 
The balance equation can be written directly 


A=0 

X = —k\AX + 2k, AX —koXY = X(kiA—koY) 
Y = —kyXY + 2k XY — 2kgY? = Y (kyX — 2k3Y) 
B=+k3Y? 


whereby one has to carefully pay attention to the coeffi- 
cients. Or one uses the network approach learned in the 
previous section: 


V1 v2 V3 

-130 0 0 |A 

N= |-1+4+2 -1 0 |x 

0 —1+2 -2/Y 

0) 0 1 |B 

which results in: 
‘ © 8) tax 
.| =S=Nv= kyXY 
4 0 1 =2) |e 
B 0 0 1 : 


This way might be less error prone in praxis. In both 
cases we obtain: 


X =X(kjA—kY) 
Y =Y(koX —2k3Y) 


where A is kept constant, and B does not influence the 
behavior of X or Y. Because A is constant, we could also 
combine it with k; to another constant kjA =C. Also 
k3 could be updated 2k; = k,. In the following we do 
not want to drag the parameters on and only study the 
system: 


X=X(1-Y) 
Y =Y(X—-Y) 

Task b) Determine the steady states of the system. 
g(X,Y)=X=X(l-mhY) +0 1 
g2(X,Y) =Y¥ =¥(X-Y) =0 I 


Steady State 1: 


pi=0] ize 


Steady State 2: 


in II 


Task c) Linearize the differential equations around the 
steady state and transform them into state-space descrip- 
tions (states, inputs, outputs in matrix/vector represen- 
tation). 

The general form is: 


F (X,Y) = g(X*,¥*) +.9,.(X*,Y*)AX + 9) (X*,Y*)AY 
sometimes also written as: 


AX = fi(X,Y) — g1(X*,¥*) 

= 81,(X",Y")AX + gi, (X*,¥*)AY 
AY = f2(X,Y) — g2(X*,¥*) 

= 82,(X",Y")AX + go,(X*,¥*)AY 


using X = X*+AX with AX = (X — X*). In our example, 
we obtain: 


X= fi(X",¥") =X"(1—-Y") + (1-Y¥"*)Ax+ (—X")Ay 
Y= fo(X"¥") = V(X" —¥") + (¥")Axt (X* — 2Y")Ay 
which is in the general state-space representation: 
Re flaw. -ae Ax\ , ( X*(L-¥") 
yh NY ROR) Ay ¥*(X*=¥") 
SS 


— 98x) 
Ja Se 


x 


Steady State 1 (x* =0,Y* =0): 


Steady State 2 (x* =1,Y* =1): 


? 


Task d) Calculate the Eigenvalues of the linearized sys- 
tem at the steady states and determine thereof the be- 
havior of the system in the vicinity of the steady states. 


|sI—A| = S—aj\ a\2 
a2) S— a2 
! 
= (s—ay1)(s—ap2) — aj2a21 = 0 
Steady State 1: 
s—-1 0 
|sI—A| = ae 


= (s—1)s=0 


Chapter 3: The magic of change and how to find it — 53/56 


with the Eigenvalues: 


sp=l 
52 =0 


The positive Eigenvalue indicates an unstable steady 
state. Steady State 2: 


|sI—A| = 


Ss 1 
—1 s+l1 

! 
=s(s+1)+1=0 
= +541 


The quadratic equation is solved: 


ltVJV1-4 1 i 
fig pts 


and results in 2 complex numbers with negative real part. 
Thus, we have a stable focus. 


Task e) Determine the equations of the vertical and 
horizontal tangents. 


Horizontal tangent: 


¥=¥(X-Y)=+0 
+ (Y=0} 
Vertical tangent: 


! 


x=X01=Y¥)=0 
= [K=0} 


Task f) Optional: Determine the rectilinear trajectories. 
1213. 


Ansatz’-~: y=mx+b 


12 Ansatz: X = mY +b 


dx ae mY +b—(mY +b)Y 
dY (mY +b)Y —Y? 

= mY +b— mY? —bY 
mY2 + bY —Y2 


=> mY? + bmY — mY? —mY —b+mY? +bY =0 


Y?(m* —m+m)+Y¥(bm—m+b)—b=0 
13 Ansatz: X =b 
X=0 
X=0=X(1-Y)=b(1-Y) 
=>b=0, |x=0 
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2.5 


Figure 44. Phase plot: Red dots are steady states of 
the system. Dotted lines are the vertical and horizon- 
tal tangents. Initial values are: (x = 0.2,y = 2) and 
(x = 2.5,y = 2). 


dy X(mX +b) —(mX +b)? 
dX ——~—S«X —X(mX +b) 
mX* + bX —m’X? —2bmXx — b* 
X —mX?— bX 


= mX — m?X? — bmX — mX? — bX + m?X? + 2bmX 
X?(—m? —m+m?) + X(m—bm—b+2bm) +b? =0 


> [b=0), [m=0}, 


Task g) Sketch the phase portrait. 
We try to help our drawing with additional point tests: 


Test at (X =2,Y =1): 


The phase plot can be seen in Figure 44 and the corre- 
sponding MATLAB code is shown below: 
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function PhasePlot2 


& Title: 

% Author: 
%& Licence: 
% Last change: 


phase plot 

Marco Albrecht 
EUPL v1.2 

01.09.2018 


= 1;% 
% Trajectories 
rajectory 1 
span=0:0.01:150;%[t_start t_end] 
c0=[0.2; 2];%[initial conditions] 
a,cl] = odel5s(@system,tspan,c0,[],p); 
S$Trajectory 2 
tspan=0:0.01:150;%[t_start t_end] 
c0=[2.5;2]; [initial conditions] 
4,c2] = odel5s(@system,tspan,c0,[],p); 


tH 


S% Plot 

Figure (1) 

set (gcf,'color', 

hold on 

% trajectories 

plot (d1:(2,1):. cL 2,2); “Calor, [0.4 

0.75 0.75], 'Linewidth',2) 

plot (e2(:,,.1). 02: (2,2), ‘Color’, 

0.6 0.85], 'Linewidth',2) 

% Asymptotes 

ploe( (0 2.5], 10 2.5], "k==") 

plot([0 2.5],[1 1],'k--') 

% steady states 

plot (0,0, 'ro', 'MarkerSize',3, 'LineWidth',5) 

plot(1,1,'ro', 'MarkerSize',3, 'LineWidth',5) 

% arrows 

annotation('arrow', [0.54/2.5 
0:..52/2.5)], [1.8/2.5 


Tw); 


[0.85 


1.6/2.5), “e6lee", [O24 0875 
0.75], 'Linewidth', 2) 
annotation('arrow', [1.2/2.5 
1.3/2 .5)|7 [068/225 a4 
0.87/2.5],'Color', [0.4 0.75 
0.75], 'Linewidth', 2) 
annotation ("arrow', [1.5/2.5 
LB 2-5 Nip: [LO 2 2 5 sises 
1.78/2.5]; "Color", [0:85 0:6 
0.85], 'Linewidth',2) 
% etc 
ylabel('Y"') 
xlabel ('X"') 
end 
function dcdt=system(-7,c,p) 
6% Load parameters 
% p.a=1 
& p.b=1 
6% System dxdt=A*x 
dedt (1) = p-axc(1)*(1-c(2)) 76 X 
dedt (2) = p.b«*c(2)*(c(1)-c(2)) io y 
dedt = dcedt'; & important! - transposes 


the solution vector 
end 


Notes 


Notes 


Chapter 4: Physical modeling and non-linear enzyme 
kinetics 


Thomas Sauter, Marco Albrecht 


Motivation 

Reality can be difficult to grasp. Philosophers puzzle their heads over the true nature of reality and our image of 
it. Accordingly, modeling is the art of coming close enough to reality to give valuable insights, without actually 
being reality. Sometimes we have to include more details. The reaction rates v, with which you are already 
familiar, can be replaced by more complicated non-linear enzyme kinetics under certain circumstances. Moreover, 
mathematical equations might be too flexible initially because they are entirely virtual. We have to actively 
include physical constraints to let the model behave more like reality and to force the model behavior into realistic 
paths. Different modeling techniques can require the application of different physical laws. But instead of making 
our models more and more complicated, we have to ask ourselves: what is essential and thus has to be included 


in a model and what only distracts us from the real kernel? But what is the real kernel? Which criterion can help 
us to choose an appropriate approximation of our problem? We can neither ignore reality nor can we ever fully 
represent it. Join us in finding a healthy distance from reality and in exploring the fascinating and comprehensive 
field of enzyme kinetics. Modeling is a wonderful tool for thinking deeply about the origin of all these superficial 
observations which we encounter in private life and our professional existence. Let’s make this tool sharp! 


Keywords 
Model classification — Model building — Assumptions — SI units — Akaike information criterion — Balance 
equation — Enzyme kinetics 


Contact: thomas.sauter@uni.lu. Licence: CC BY-NC 


Contents already, the graph alone can give us insight. However, 
we have not modeled physical systems. In order to model 
1 Lecture summary 4 physical systems, we need to think about physical laws 
1.1 MiWhatis.a model? .......4..4<. , in thermodynamics, include physical relevant param- 
1.2 Ml Modeling procedure ........... 2 eters, think about reaction laws, and get familiar with 
1.3. Ml Mathematical analysis of models... .. 3 unit calculus. However, we begin with the question of 
1 A> MModeal clascificalion. 4.25 ccs oe 5 what distinguishes reality from its approximation. 
15 “Wi Alsibe Internation Catevioti 6 osc 5 7 If you prefer, you could also jump directly to balancing 
1.6 MM Extensive properties & SI units... . . 7 (page 10)—an approach which enables us to formulate 
17 MBalancing .............000. 19 mathematical models of physical and biological systems. 
1.8 WEnzymekinetics ............. 12 
1.1 HB What is a model? 
2 Basics of Mathematics 19 "Un cercle n’est pas absurde,|..] Mais aussi un cercle 
n’existe pas" / "A circle is not absurd; it is clearly ex- 
References 20 plained by the rotation of a straight segment around one 
of its extremities. But neither does a circle exist." This 
3 Exercises 22 citation is from the work La Nausée written by Jean-Paul 


Sartre! [1,2]. Sartre was desperate because whenever he 
had a word for or a model of something, it would never 
be the reality he wanted to describe. Because we are 
unable to name something appropriately, the real things 
1. Lecture summary remain nameless. The difference between the beauty of 
abstract description and the imperfection of real-world 


4 Solutions 
Do not betray yourself! 26 


We learned in the previous chapters how to set up system 
equations and understand their dynamics. This is espe- 1 French philosopher, novelist, political activist, playwright, literary 
cially helpful for Metabolic Network Analysis. There, critic, and biographer Jean-Paul Charles Aymard Sartre (1905—1980). 


© 2023 Thomas Sauter and Marco Albrecht, CC BY-NC 4.0 https: //doi.org/10.11647/OBP.0291.04 
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phenomena nauseated him because it revealed to him the 
uselessness of existence. In the world of mathematics, ev- 
erything seems to be perfect and pure—like a circle. But 
you will never find a perfect circle in reality. Thus, the va- 
riety of human existence might suggest that something 
perfect—or a higher power which leads us—cannot exist. 
Instead, it is up to us to turn our existence into essence. 
Independent of our opinion of the European movement 
of existentialism, this movement urges us to get a bal- 
anced feeling for what models are. We have to bring 
them close enough to reality to be useful, but one has to 
stop the refinement if this stops helping us answering 
real-world questions. George Box” once wrote: "Remem- 
ber that all models are wrong; the practical question 
is how wrong do they have to be to not be useful" [3]. 
This pragmatic view helps us navigate the process of 
understanding reality with useful models, but it spares 
us depression caused by exaggerated perfectionism. 

Mechanistic models have the following advantages [3]: 


e They contribute to our scientific understanding of 
the phenomena under study. 


e They usually provide a better basis for extrapola- 
tion (at least to conditions worthy of further ex- 
perimental investigation, if not through the entire 
range of all input variables). 


e They tend to be parsimonious (i.e. frugal) in the 
use of parameters and provide better estimates of 
the response. 


We want to address what the reality is in our specific case. 
Biologists know that experimental models in biology 
should resemble—but usually do not fully represent— 
real-world phenomena in vivo. However, the biologist 
does not really need to think about characteristics he or 
she does not care about. We biologists have enough trou- 
ble ensuring that one’s experimental setting allows one’s 
living system to operate in the desired states and that 
the observations (measurements) indeed show what the 
real system states look like in principle. It is also the 
pragmatic realization that it is just impossible to mea- 
sure all possible players at any time under any condition. 
But, many things in biologists’ system of interest are al- 
ready true because they work on a real system. If they 
model something, they work on an entirely virtual sys- 
tem which they think represents what they care about. 
This does not mean it is true. It is an abstraction. One 
has to actively ensure that one’s mathematical model 
is physically correct, to figure out which elements one 
can neglect and which not, and to iteratively compare 
one’s simulation result with experimental data. Which 
constraints one must take into account depends on one’s 
chosen modeling framework, its inherent mathematical 


? British statistician George Edward Pelham Box (1919—2013). 
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characteristics, and the underlying assumptions. Con- 
sequentially, the interpretation of the model’s result de- 
pends on one’s modeling strategy and the considered 
assumptions, and is restricted to the considered concep- 
tualization. This applies to the modeling framework 
itself and to the content of the model. Modeling helps us 
to dive into a deeper understanding of what we observe 
and to find out what does not work or maybe why our 
conception of reality fails. Modeling is not a tool to con- 
firm our premature beliefs about what is helpful in order 
to seem successful. The less we know, the more careful 
we should be with interpretation. Modeling is a tool 
to helps us understand biology, but it does not replace 
experiments. Bottom-up modeling helps us to develop 
a consistent theory on what we observe and top-down 
modeling help us to extract the most likely relationships, 
but modeling cannot magically generate new data from 
nothing. Never fall in love with your model and always 
be sceptical and aware of its shortages. Moreover, it is 
not an excuse to oversimplify a model until it shows the 
obvious and thus becomes useless. Useful modeling is 
the art of representing reality to a degree that goes be- 
yond what you can understand on your own and find 
the limits of what the mathematical framework can han- 
dle with given data. Therefore, you need to master the 
mathematical framework, its physical interpretation, and 
the biology of what your model represents. You need. 
the knowledge on how experimental data has been de- 
rived, how data can be interpreted, and whether you can 
use this data in the context of your model. The general 
modeling approach is part of the next section. 


1.2 HM Modeling procedure 
A mathematical model uses mathematical descriptions 
and computer simulations. A model can be defined as: 


Definition 1. Model: A model is the representation of 
the essential aspects of a system which represents knowl- 
edge of that system in usable form (Eykhoff, 1974) [4]. 


The modeling procedure is an iterative approach and fol- 
lows a cycle such as that shown in Figure 1 and usually 
requires an iterative procedure of computational and ex- 
perimental analysis (Figure 2). The next section is based 
on a textbook in German language [5]. 


Specify the question of interest 

The question of interest is of utmost importance. It will 
hardly ever be the case that you have a universal model 
with which you can explain all kind of questions. With 
the main question in mind, you can start with the sim- 
plest model that can roughly represent your system. As 
soon as it works, you can refine it iteratively. Modeling 
scopes can be: 


e Scientific understanding 
e Validation or falsification 
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3 


real a = mathematical 
“f model 


2 
real oe 1 > situation 4 
situation model 
6 
real mathematical 
results results 
rest of : 
the world mathematics 


Figure 1. Modeling cycle of Blum and Leif. 1: under- 
standing 2: simplifying/structuring 3: mathematizing 
4; working mathematically 5: interpreting 6: validating 
[6]. Copyright © 2007 Authors (Blum, Leif), Fair Use. 


Prediction 

Decision-making, strategy planning 
Design of a controller 

Virtual prototype or material simulation 
Pattern recognition. 


The modeling scope determines the level of detail of 
the model. An experienced modeler can plan the mod- 
eling strategy with the mathematical implementation 
and numerical issues in mind. Numerical analysis° isa 
mathematical discipline which tends to solve problems 
approximately. Thereby, errors can occur which have to 
be minimized. Not all numerical methods can be applied 
to any problem, and sometimes the solving process can 
go completely wrong. 


Assumptions 

Assumptions about reality help us to make the real- 
world problem abstract and solvable. There are, for 
example, assumptions for simplifying the description 
of inter-dependencies or assumptions for constraining 
the number of states to investigate. Assumptions should 
not contain the obvious, should enable a useful model, 
and should not lead to infeasible models. Assumptions 
build the foundation of the model and determine the 
interpretation of the outcome. Each assumption has to 
be justified and checked for the qualitative and quan- 
titative impact on the model. The assumptions should 
not contradict one another. Assumptions might change 
during the modeling process. The following questions 
help us to identify appropriate assumptions: 


e Which effects can be neglected? 

e Which scenario might be a good starting case? 

e Which parts of a system are necessary and which 
are not relevant? 

e Cana part of the system be prescribed? 


3 Earliest mathematical writings from Babylon 1800—1600 BC 
(Iraq). 


Model building 

The modeling procedure starts with a verbal model 
which is the initial model concept. It helps us to de- 
termine what belongs to the model and what does not, 
and is specified by the expert of the system. Defining 
the system's purpose is a part of this process. The verbal 
model is written in the everyday language of the mod- 
eler and the experts. It is also the starting point for the 
influential structure of a given system. The following 
steps are necessary to build a model in order to answer 
the question of interest: 


e Specify model and system parameters 

e Specify model variables and system states 

e Specify the interdependencies between the states 
with auxiliary conditions and physical laws 

e Formulate the mathematical procedures, such as 
optimization, sensitivity analysis etc. 


The system variables represent the states which change 
during the simulation. The system variables span the 
phase space and should fulfil the following criteria: 


e Independence: No system variable can be described 
as a function of another. 


e Completeness: The state of the system is fully ex- 
plained by all state variables. 


The auxiliary conditions describe the interdependencies 
between the system variables and are adjusted by the 
system parameter. Modelers use concept maps to draw 
potential cause-effect relationships and to reason about 
what has to be modeled and how. Engineers are, like 
biologists, very visual scientists and need drawings for 
fruitful communication. 

A system parameter is a number that does not change 
with time. In the previous chapters, we mainly talked 
about rates. These rate terms represent equations by 
themselves and include parameters. Only in the most 
simple case are rates represented by a single parameter. 
A typical procedure in modeling is the search for param- 
eters which are able to let the model behave comparably 
to reality. 

A good procedure to get good models is to ensure that 
one has as many equations as variables (remember the 
Rouché-Capelli theorem). 


1.3 Ml Mathematical analysis of models 

Mathematical methods can be used to make models fea- 
sible, to solve them and to understand their properties. 
This section is based on the same textbook in German 
language [5]. 


Dimension analysis 

The model variables might have physical meaning and 
units. A reformulation can be helpful to get rid of the 
units with a proper unit calculus. This process is called 
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Figure 2. The principle of iteration in modeling. We usually start with a hypothesis and choose an appropriate 
modeling strategy (graph, discrete, continuous, static, stochastic, deterministic). We integrate data and adjust the 
model until it fits. Then we perform a prediction and see whether the model is qualitatively and quantitatively good 
enough. If not, the cycle is repeated until we sufficiently understand the system. More information and source: [7]. 


Copyright © 2009, Nature Publishing Group. 


non-dimensionalization. Moreover, relevant lengths or 
magnitude ranges of the variable values might be de- 
fined to determine what relevant influences are. Ques- 
tions of scale are relevant. Multi-scale models can con- 
tain different relevant magnitude orders of interest. 


Inverse and poorly posed problems 

One might perform certain steps to check whether the 
model is mathematically well-posed in itself. If not, 
mathematical procedures might perform poorly. A well- 
posed problem has a unique solution. This solution 
changes continuously with continuously changing pa- 
rameters. A system matrix which is not invertible can 
cause numerical problems. Matrices with a high condi- 
tion number are ill-conditioned. A condition number 
measures how much an output value changes with small 
changes of the input number. 


Investigate special problems 
One might first solve special cases before investigating 
the more complicated general case. 


Simplification and linearization 

One might neglect terms with minor impact to see if the 
simplified model behaves similarly to the original model. 
The neglected terms are called small disturbances of the 


simplified model (perturbation theory). Karl Popper * 
once said: "Science may be described as the art of sys- 
tematic over-simplification—the art of discerning what 
we may with advantage omit." 

Linearization around specific points might help inves- 
tigate the model’s behaviour and acquire information 
about the non-linear model at specific points. 


Check qualitative properties 

One runs the model and gets conclusions on asymptotic 
behavior, system stability, and if oscillations can occur, 
among other qualitative properties. 


Sensitivity analysis 

After one has found a solution with a given set of opti- 
mal parameters, one wants to know how small changes 
of parameter values influence the solution. This is an 
important type of analysis to carry out. 


Interpretation and validation 

The interpretation of the results might happen in light 
of previous expectations and requires the consideration 
of all assumptions. The simulation might give us a pre- 
diction which is then subjected to a validation process. 
If experimental data can be generated, one has to deter- 
mine how tolerant one is ready to be. How well does 


* Austrian and British philosopher Sir Karl Raimund Popper (1902— 
1994). 
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the model fit the new data? Sometimes no data can be 
generated. In this case, check the plausibility with the 
experts of the given system. Surprising simulation re- 
sults might lead to a deeper investigation of subsystems 
or checking whether parameters and variables remained 
in physically meaningful ranges after the previous opti- 
mization. One cannot prove the ‘correctness’ of a model, 
and a model cannot be ‘verified’. Correct settings in a 
simulation cannot constitute proof of accurate behavior 
under all possible circumstances. The only thing it can 
do is falsify the theories and models, e.g. if observations 
and simulations disagree. Consequently, we always say 
that we validate a model in light of the model’s purpose 
and in the following aspects: 


Structural validity: The model has an influence struc- 
ture which resembles the influence structure of 
the original. The number of essential states and 
the feedback structures have counterparts in the 
original system. 

Behavioral validity: The model shows qualitatively the 
same dynamic behavior, e.g., oscillations or not. 

Empirical validity: Within the model’s constraints, the 
simulation result follows experimental data and, 
if no observable data is available, the results are at 
least consistent and plausible, and the parameter 
ranges are empirically justified. 

Application validity: The model fulfils its purpose and 
can be applied to make decisions, to control pro- 
cesses etc. 


1.4 MH Model classification 

Usually you do not have to reinvent a modeling strategy. 
Often you can use previous procedures and apply these 
to your problem of interest. How often which method 
type is used can be seen in Figure 3. The spectrum of 
some methods in systems biology can be seen in Figure 
4, Mathematical approaches are listed in Figure 16 in 
the section on mathematical basics. Here, we give an 
overview on model types. This section is based on the 
same textbook [5]. 


Mathematical structure 
We can distinguish the following contrasting pairs: 


Static or dynamic models: Dynamic models change over 
time. Static models are time-independent or are in the 
steady state of a dynamic model. The time can be con- 
tinuous or discrete. 


Discrete or continuous models: If the variable values can 
be counted, one is dealing with a discrete model. See 
Figure 5 for an illustration of continuous and discrete sig- 
nals in time and state. Each class requires fundamentally 
different modeling strategies. Discrete models require 
methods of graph theory and the theory of finite-state 
machines. Continuous models require higher mathe- 
matical models of calculus. However, it is possible to 


discretize continuous models to obtain and handle a 
simpler and reduced discrete model in the hope it suffi- 
ciently resembles the original continuous model. 


Deterministic or stochastic models: A deterministic model 
always comes to the same solution with given parame- 
ters and initial variable values. Stochastic models oper- 
ate with random events and are used for models with 
certain likelihoods. Stochastic models are also used for 
very complex models where the structure is not fully 
understood. 


Sometimes we have models with mixed elements; how- 
ever, it helps to understand the character of each in or- 
der to make wise choices and to be aware of the conse- 
quences. 


Level of description 

1st-principle models, white-box models Models can be 
assembled from known laws or well-understood 1st prin- 
ciples. 


Heuristic models, gray-box models Models based on more 
or less justified assumptions for the cause and effect 
inter-relationships. Often for complicated systems with- 
out stringent rules, such as ecologic systems and the 
Predator-Prey Model. 


Descriptive models, black-box models Models generated 
on the basis of data without system knowledge. Typically 
input-output relationships, as illustrated in Figure 6. 


: black box y 


Figure 6. The black-box concept. We know little about 
the system, but we can learn from the input u - output y 
behavior. 
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Figure 4. Spectrum of modeling methods: Reproduced picture from source: [9]. Licence: CC BY-SA 4.0 
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Figure 5. From continuous to discrete time intervals and 
discrete states. 
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1.5 Mf Akaike Information Criterion 

"Whenever a theory appears to you as the only possible 
one, take this as a sign that you have neither understood 
the theory nor the problem which it was intended to 
solve," said Karl Popper. Therefore, we usually have 
several candidate models and ask ourselves which model 
might be the best. A scheme is shown in Figure 7. 
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° 
a 


Hypothesis 
generation 


Species X 
7 


Mg 


Species X 
° 


2 4 6 
t 


7 Model complexity 


Model ranking 
f (M2) > f (Mg) > f (M4) 


Model selection 


M2 


Figure 7. Model selection: We generate different hypothe- 
ses and select the best-balanced model. Model Mj is too 
simple to mimic reality (not enough parameters). Model 
M3 fits the noise rather than the underlying system and 
is thus not robust (too many parameters, overfitting). 
Model M2 represents a good fit. Source: [10]. Copyright 
© 2013, Elsevier Ltd. 


A commonly used decision tool for the model selec- 
tion is the Akaike Information Criterion (AIC), amongst 
other similar criteria. The AIC takes into consideration 
the goodness of fit L and the number of parameters k: 


AIC; = 2k—21n(L). 


The model with the minimal AIC should be preferred. 
This criterion says nothing about whether a model is 
good or bad. It simply says that a model is capable 
of generating behavior in line with data. The relative 
likelihood of a subordinate model i to the minimal model 
AIC; can be calculated with: 


exp((AICmin — AIC;) /2) 


For small models, the AIC might point to models with 
too many parameters. The model is overfitting then. It 
means that you fit noise instead of the model dynamic, 
which increases the likelihood of giving false predictions. 


With the following correction: 


2k? + 2k 


AiGc= Aes 
. 7 a 


you increase the impact of the parameters. A large sam- 
ple size n can reduce the impact of this correction factor. 
For those of you who want to learn more, there is also the 
Bayesian Information Criterion (BIC), the Takeu-chi’s 
Information Criterion (TIC), and the Widely Applicable 
Information Criterion (WAIC) [10]. The model selec- 
tion can thereby depend on the experimental design [11]. 
Minimizing the AIC is effectively equivalent to maximiz- 
ing entropy in a thermodynamic system according to the 
second law of thermodynamics. 


1.6 Mf Extensive properties & SI units 

Besides continuous macro-scale and stochastic micro- 
scale thermodynamics, we have to differentiate between 
extensive quantities” and intensive quantities, if it comes 
to transporting from one system to another. Intensive 
quantities such as temperature, concentration, pressure, 
and density are independent of the system size or amount 
of material. Thus they cannot be physically transported 
through a system boundary. Extensive quantities, how- 
ever, are defined quantities whose magnitude is additive 
for subsystems, according to the International Union 
of Pure and Applied Chemistry IUPEC. Around 30 ex- 
tensive quantities are listed in Table 1 and are grouped 
according to their relationship to mass. 


The old SI units (International System of Units) in Figure 
8 are currently under reformulation. We mention the 
newest proposals: 


Definition 2. Second: The second, symbol s, is the SI 
unit of time. It is defined by taking the fixed numerical 
value of the caesium frequency AvCs, the unperturbed 
ground-state hyperfine transition frequency of the cae- 
sium 133 atom, to be 9192631770 when expressed in the 
unit Hz°, which is equal to s~ , 


Definition 3. Meter: The meter, symbol m, is the SI unit 
of length. It is defined by taking the fixed numerical 
value of the speed of light in vacuum c to be 299792458 
when expressed in the unit m- s_!, where the second is 
defined in terms of the caesium frequency AvCs. 


Definition 4. Kilogram: The kilogram, symbol kg, is the 
SI unit of mass. It is defined by taking the fixed numerical 
value of the Planck’ constant h to be 6.62607015 - 10~*4 


5 Most textbooks use the term "extensive properties"; however, this 
term is impure in this context because properties like color are not 
quantitative, and some extensive quantities such as heat and work are 
not properties [12]. 

6 German physicist Heinrich Rudolf Hertz (1857—1894). 

7 German theoretical physicist Max Karl Ernst Ludwig Planck 
(1858—1947). 
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Table 1. Extensive properties: This table gives you a feeling for different extensive properties. You do not need 
to learn this by heart—you simply need to understand the principle. Z: extensive property. m: mass. z: intensive 


property. 
Symbol Quantity Unit Conditions Formula Notes 
(Z/m = ) 

Zl proportional to mass 
m mass kg none® m/m = | 
ge weight Jm7! gconstant F,/m=g g=98lms? 
Z2 proportional to mass under constant composition (Z/m = k), else Z2 = Z3,° 
C3 heat capacity Jk"! n,constant C,/m =c Cp = specif c heat cap. 
n amount of substance mol n; constant n/m = 1/M M = molar mass 
Vs volume (solid, liquid) m? nj constant Vs,/m =p p = density 
N number of particles = n; constant 
Z3 mass is constant of proportionality, or conditional proportional (Z/m = z) 
Ex kinetic energy J E,/m = iv v = velocity 
BE, gravitational J Eg/m = gh h = height, 

energy g =9.81m/s" 
p momentum kg-ms p/m=Vv v = velocity 
F> force Im! F/m=a a = accelaration 
U internal energy J U/m = u(T) T = temperature 
G free enthalpy J G/m = g(Tp) g = specif c free 

enthalpy 

H enthalpy J H/m =h h = specif c enthalpy 
S entropy JK"! S/m = s s = specif c entropy 
VG volume (gas) m? V<o/m = M.v<(T,p) Vg = molar volume 
Vv volume m? V = Vor + VG 

(all phases) 
ZA extensive quantities independent of mass 
A area (interface) m? — 
E. electric energy J — 
Ey, spring energy J Ey = tk.x? k = spring constant 
Br displacement energy J Ep = p.V p = ambient pressure 
Esie surface energy J Eyur = Y.A Y = surface tension 
i current (electric-) Cy — 
P power (most forms) Js! = = 
Q heat J za = 
q charge (electric) C = 
WwW work J - = 
€ extent of reaction eq! = conjugate to aff nity 
d&/dt reaction rate eqs/4 - 

f ow (f uid) kg s7! — 

total quantities = 29 total 


when expressed in the unit J-s, which is equal to kg - 


2 


m?-s~!, where the meter and the second are defined in 
terms of c and AvCs. 


Definition 5. Ampere: The ampere,® symbol A, is the 
SI unit of electric current. It is defined by taking the 


8 French physicist and mathematician André-Marie Ampére (1775— 


fixed numerical value of the elementary charge e to be 
1.602176634- 10~!9 when expressed in the unit C, which 
is equal to A-s, where the second is defined in terms of 


AvCs. 


1836). 
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Old SI 


Figure 8. Old SI units: s: second, kg: kilogram, mol: 
mole, cd: candela, K: kelvin, A: ampere. m: meter. 
Source: Wikipedia, Emilio Pisanty, 2016, Licence: CC 
BY-SA 4.0. 


Definition 6. Mole: The mole, symbol mol, is the SI 
unit of amount of substance. One mole contains exactly 
6.02214076 - 1073 elementary entities. This number is the 
fixed numerical value of the Avogadro constant,’ Na, 
when expressed in mol~', and is called the Avogadro 
number (IUPEC, 2018, [13]). 


Definition 7. Amount of substance: The amount of 
substance, symbol n, of a system is a measure of the 
number of specified elementary entities. An elementary 
entity may be an atom, a molecule, an ion, an electron, or 
any other particle or specified group of particles IUPEC, 
2018, [13]). 


Definition 8. Dalton: The 1971 definition of the mole 
implies that the Avogadro number equals the ratio of the 
gram to the Dalton!” (mu = 1 u = 1 Da), with the value of 
the Dalton (Da) expressed in grams. The historical conti- 
nuity of the present definition preserves this relation, not 
exactly, but to within an uncertainty 10~!° negligible for 
practical purposes (IUPEC, 2018, [13]). The relationship 
of Dalton to the Avogadro constant might be redefined 
to a scaling factor Na = ¢-mol!. 

Definition 9. Kelvin: The kelvin,'! symbol K, is the 
SI unit of thermodynamic temperature. It is defined 
by taking the fixed numerical value of the Boltzmann’? 
constant K to be 1.380649 - 10-73 when expressed in the 


* Italian scientist Amedeo Carlo Avogadro (1776—1856). 
10 English chemist, physicist, and meteorologist John Dalton (1766— 
1844). 
11 Scots-Irish mathematical physicist and engineer William Thomson 
Kelvin (1824—1907). 
? Austrian physicist and philosopher Ludwig Eduard Boltzmann 
(1844—1906). 


New SI 


Figure 9. Proposed SI units: s: second, kg: kilogram, 
mol: mole, cd: candela, K: kelvin, A: ampere. m: meter. 
Source: Wikipedia, Emilio Pisanty, 2016, Licence: CC 
BY-SA 4.0. 


unit J-K~!, which is equal to kg-m2- s-2-K—!, where the 
kilogram, meter, and second are defined in terms of h, c 
and AvCs. 


Definition 10. Candela: The candela,'’, symbol cd, is 
the SI unit of luminous intensity in a given direction. 
It is defined by taking the fixed numerical value of the 
luminous efficacy of monochromatic radiation of fre- 
quency 540- 10!? Hz, Kea, to be 683 when expressed in 
the unit Im- W~!, which is equal to cd-sr-W7!, or 
cd- sr-kg~!-m~?-s3, where the kilogram, meter, and 
second are defined in terms of h, c and AvCs. 


With this new SI set, the relationships between the SI 
units might change according to the scheme in Figure 9. 


13 From candle. 
14 sr: The steradian or square radian is the SI unit of solid angle. 
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1.7 & Balancing 

After this detailed introduction to modeling concepts 
and other relevant background information, we are now 
coming back to the question on how to formulate mathe- 
matical models of biological systems. We will introduce 
the approach of balancing here, which is an extension of 
the ODE equations derived for biochemical reactions in 
the previous chapters. The general concept of balancing 
for describing system behavior over time is: 


temporal change | _ : 

of state variables | — +[inflow } ‘outflow } +(-source } 
-| sink 
Inflows and outflows are exchanges across the system 
boundary. In contrast, sources and sinks are system 
internal processes. 


Continuous changes are described with differential equa- 
tions. 


dd _ 


—=@= J+P 
dt . 


SY 
Rates of change 


In a (time) discrete form this would be: 


A@=O(t+1)—O(t) = J+P 
Changes 


=> @P(t+1)=@(t)+J+P 


Example 1: Population balance 


A continuous population balance with population 
size N |—]. 


- immigration emigration a“ birth 

= rate ~ rate rate 
death 
rate 
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y m3 kg 
qout — * Pout Fe 


Figure 10. Mass balance of a liquid container. 


Example 2: Mass balance 


The mass balance of a liquid container is as shown 
in Figure 10: 


; kg 
M = Qin’ Pin — Yex* Pex re 


Example 3: Volume balance 


The volume balance can be derived from the mass 
balance: 


m=(p-V)=p-V+p-V=p-V 


with the product rule, and with the assumption 
that the density is constant and does not change 
p =const. 


P=v 


Example 4: Amount of substance balance 


The amount of substance balance is a type of mass 
balance which is especially suited for chemical reac- 
tions. It is also known as mole balance. The change 
of the amount of substance is i with the unit [™°] 
The rate constant is k and can have different units. 


Let us look at the following reaction: 


A+B—+P 
with the balance equation: 


—na = —Ng =np =k-Ca-Cp-V 


The unit calculus is 


Be ee 
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The missing unit X for the rate constant k is there- 


fore [1]. Remember: ng = C4-V 


=> NA = (CaV) =CyV+C4V =CaV 


with the product rule and constant volume V = 
const: 


V=0 
We moreover obtain the following relationships: 
=> —ny = —Cy-V=k-Cy-Cg-V 


=> Ca = —k-Cy-Cp 


We might also point to the rate of mole change: 


Some useful equations 
The molar mass is: 


mM _ Pi 
M; = — = — Ae m; = M;-n; 
nj Ci 


The mass flow 1; [| is: 


m= q: Pi 
with volume flow rate q =| and mass density p [3s] ‘ 


The molar flow 7; [mot] is: 


Ny = 4° Cj 


with volume flow rate q [=| and molar concentration 


Cj [2] . The conversion is: 


mol _3mol 3 
m L 


with the unit molar M and millimolar mM. 


Reactions 
We repeat some reactions and their respective balance 
equations: 


ki dna __ dnp dna __ 
A P dt — dt dt k-ca-V 


k 
A==P ata — _ky-cq-V+k_1-cp-V 


A+B 4P 


k-4 


AR spies 


-1 


Wa — —ky-ca-cp-V+k-1-cp-V 


d 
a = —k,-ca-cp-V+k_,-cp-cs:V 
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1.8 M Enzyme kinetics 

Enzymes are very important proteins and good drug 
targets. For example, the BRAF kinase’? can be inhib- 
ited by the drug dabrafenib. Dabrafenib in turn can be 
degraded by another enzyme class named cytochrome 
P450. The CYP450 enzyme class accounts for 75% of 
drug metabolism and five of the 57 human CYPP450s 
are involved in 95% of these reactions [14]. Enzymes 
are sometimes very unspecific and share similar targets. 
Thus when an enzyme is inhibited, this can result in 
undesired off-target effects also on other enzymes. Have 
a look at the phylogenetic tree of the human kinome in 
Figure 11. 


. ag 


peg tee J 
. RNAseL pa = 
cuss SN = 
Gs, ° v ‘espn Saree 
7 " CK1 
SP "oe 


NEKiP 


‘Atypical Protein Kinases 


age |_—__-—@anen 


“Illustration reproduced courtesy of Cell Signaling Technology, Inc. (www.cellsignal.com)" 


Figure 11. Kinase tree: The maps shows the phylogenetic 
tree of the human kinome using the KinMap software. 
Sky-blue annotations are kinases which are melanoma- 
specific. Red dots indicate kinases for which drugs are 
available. Source: [15], Licence: CC BY-SA 4.0. 


Chemoinformatics can analyze the molecular struc- 
ture of both enzyme pockets and drugs together. 3D 
pockets of key targets and off-target kinases thereby al- 
low the design of more specific drugs [16]. This helps us 
to predict off-target effects, enzyme-drug affinities, and 
support the improvement of drug structures. We will 


15 Kinases catalyses the transfer of the phosphate group to specific 
substrates. The opponent is the phosphatase. 
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concentrate on the enzyme kinetic for biochemical net- 
works. In Systems Biology, we are often more interested 
in the interplay of an enzyme with other molecules and 
the role of an enzyme in a network. We might thereby 
ignore the chemical complexity as well as the structural 
geometry of the enzymes and ask e.g. only the following 
questions: 


e Which molecules can bind the enzyme? 

e Which molecules will be released? 

e Which molecule-enzyme affinities appear? 
e Is the binding reversible? 


e Does the molecule inhibit or activate the enzyme 
function? 


e Does the molecule bind the catalytic side or an 
allosteric side? 


e Is the enzyme specific? 


YouTube: Enzyme kinetics series (7 videos by Khan 
Academy) 


The reaction from substrate to product is usually pro- 
moted by enzymes. Enzymes split the reaction into less 
energy-intensive subreactions and thereby have a bind- 
ing phase and a catalytic phase, as shown in Figure 12. 
You learned in Chapter 2 that we can model biochemical 
networks with the stoichiometric matrix: 


S=N-v 


which we often scrutinize in the steady state (dynamic 
equilibrium) with: 


O=N-v 


The non-trivial solution gave us the fluxes and interest- 
ing properties such as dead ends, linear paths, and the 
kernel matrix K. We repeat the balancing of a biochemi- 
cal reaction in Example 5. In this section we will focus 
on the reaction vector v. 
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Figure 12. Energy profile during an enzymatic reaction: 
The bigger the energy difference, the lower the probabil- 
ity of transformation to the products. Consequently, the 
reaction rate is lower as well. Source: Thomas Shafee 
& Smokefoot, Wikimedia, Licence: CC BY 4.0 (upper 
figure) & CCO 1.0 Universal Public Domain Dedication 
(lower). 


Example 5: Balancing a biochemical reaction 


The substrates are catalyzed to form 2 product 
molecules: 


Vf 
S1 + Sz == 2P 
Vb 
The overall (net) reaction rate is calculated by the 


forward reaction rate vr minus the backward reac- 
tion rate v,. The ODE equation set is: 


S, = Sp = —ky-S,-So tk} a ca 
VF Vb 
P=2k,-8,-S)—2k_1-P* 


Lad Vb 


The reaction coefficients vr and v, are based on the 
Law of Mass Action formulated by Waage'® and Guld- 
berg!” in 1864 [17]. We say that the reaction rate is pro- 
portional to the probability of a collision of the reactants. 
The probability of a collision in turn is proportional to 


16 Norwegian chemist Peter Waage (1833—1900). 
17 Norwegian mathematician and chemist Cato Maximilian Guld- 
berg (1836—1902). 


Mj... 


the concentration of reactants to the power of molecu- 
larity (stoichiometric coefficient). Refer once again to 
Example 5. The net rate of the product is: 


V=ve—Vp = ky - Sy] -S2—k_] <P 
with the related unit calculus: 


es) -Lce 


— 
v ky Sy S2 k_y p2 


The unit of the kinetic parameters k;, in the reaction 
rate v, depends on the number of reaction partners. This 
unit changes from reaction to reaction. 

The general mass action rate law for a single reaction is: 


v=ve—vy =k [[s" —k_ []?” 
i Jj 


m;... molecularities of substrates 
molecularities of products 


Reactions in equilibrium + concentrations in equilibrium 
(Seq Ef Peg ) a 


Vf =Vb 
k pri 
Gj 
ky TL;Seg 


The free energy difference of a reaction: 
AG = —RT InKeg 


depends on the temperature T and the gas constant 
R=8.3 14—,. The free energy is indicated in Figure 12. 


The Law of Mass Action might result in a lot of equations, 
and simplifications are often used. These are based on 
certain assumptions, such as those illustrated in the fol- 
lowing for the kinetic rate law of, e.g., Michaelis-Menten. 
One standard textbook on enzyme kinetics is from Cornish- 
Bowden!® [18]. We will derive some of the kinetics and 
will provide, at the end a summary in Table 2. 


Michaelis-Menten kinetics 
The Michaelis-Menten kinetic [19] is based on the most 
common known enzyme reaction scheme: 
k 
E+S—:ES-",E+P 


k-4 
An enzyme E binds a substrate S to form an enzyme sub- 


strate product ES. This enzyme substrate complex can 
break down either back to the substrate or into the prod- 
uct. In both cases the enzyme is recovered, as expected 


18 British biochemist Athelstan John Cornish-Bowden (1943— 
today). 
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Vmax 


Almost all enzyme molecules occupied 


Competition for enzyme molecules 


Lots of enzyme molecules available 


Reaction rate v 


Km Substrate concentration S 


Figure 13. Michaelis-Menten kinetic. 


for a catalyst. The balance equations of the reaction net- 
work are: 


dS 

—=-—k,-E-S+k_,-ES 

dt 1 +k] 

dES 
Gh ES (k-1 +h2) ES 
dE 

= TMV E- St (K-1 +h) ES 
dP 

aE OP 

dt a ES 


The overall reaction rate is: 


_ dS_aP 
dt dt 


The substrate decrease is equal to the product increase. 
The enzyme is recovered. Because our equation set can- 
not be solved analytically, we will make some assump- 
tions: 


1.) Quasi-equilibrium assumption 

Michaelis!’ and Menten”’ stated that the forward and 
backward reactions between the substrate and the en- 
zyme substrate complex are much faster than the final 
reaction to the product: 


ky, k_1 > ky 


2.) Quasi-steady-state assumption 

Briggs”! and Haldane ~ stated that if and only if the 
substrate is in much higher abundance than the enzyme 
S(t =0) >> E, then the ES complex has constant concen- 
tration levels, or more generally: 


dES 
“a 


19 German biochemist, physical chemist, and physician Leonor 
Michaelis (1875—1949). 

20 Female Canadian physician and scientist Maud Leonora Menten 
(1879—1960). 

21 English scientist John Burdon Sanderson Haldane (1892—1964). 

22 English professor for Botany George Edward Briggs (18931985). 
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with which: 
dE 
“~=0 
dt 
is obtained. But if neither enzyme levels nor the levels of 


enzyme substrate change, there must be a total amount 
of enzyme which we can consider as a constant: 


Etotal =E+ES > E= Etotal —ES 


Putting everything together 
Reformulation of the differential equation of the enzyme 
substrate complex gives: 


dE 
“ are (425s 26 
ia (Bgst ES") S44 ES 0 


ki - Etotal °S —k1 -ES*-S — (k-1 +kn)-ES* =0 


and: 
ES* ki Etotal:S _ Etotal “5 


~ ky -S+(k-1 +k) 7 s+ a8 


We are interested in the generation rate of the product 
depending on the substrate availability: 


dP 
= — =k, -ES* 
. dt 


which is finally: 


= ko - EtotalS = Vmax *S 
Sarthe 4g Kn +S 
1 


k 


with the Michaelis-Menten parameter K,, and the max- 
imal catalysis velocity Vmax. The Michaelis-Menten pa- 
rameter indicates the substrate concentration with which 
one can obtain the half-maximal reaction velocity. You 
will need to be able to distinguish between the parameter 


types! 


Mechanistic parameters: k,,k_1,... (based on mecha- 
nisms) 

S 
Phenomenological parameters: Vmax, Km,... (based on 
systemic behavior) 


Lineweaver—Burk plot 


Lineweaver~’ and Burk‘ established a linear represen- 
tation of the function in Figure 14 to more easily access 
the parameters. This was of great importance before the 
appearance of appropriate computer programs. How- 


23 American physical chemist Hans Lineweaver (1907—2009). 
24 American biochemist Dean Burk (1904—1988). 
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Figure 14. Lineweaver-Burk plot. Source: Wikimedia, 
Licence: CC BY SA 3.0. 


ever, modern methods are available to experimentally 
determine more physical parameters without the quasi- 
steady-state assumption [20]. The research is still on- 
going (for example, taking substrate inhibition into ac- 
count [21]). Unfortunately, many publications use the 
kinetic laws although their underlying assumptions are 
violated—e.g. drugs might lead to enzyme induction 
of the CYP450 class leading to a changing total enzyme 
concentration Fiotal- 


Michaelis-Menten kinetic for reversible reaction 
The reaction scheme for the reversible Michalis-Menten 
kinetic is: 
1 ko 
E+S— ES ==E+P 
k_-1 k_2 


where the product concentration changes accordingly: 


dP 
= — =kES—k_»P-E. 
Vv fr 2ES 2 


The rate equation is: 


Vv =E total Sk 1, ko _ P 1g = 


k-1k-9 


Ee b 
Vmax S— Vmax P 
a Kns Kump 


1 S _P 
+ Kns ™ Kmp 


The phenomenological parameters are related in the fol- 
lowing way: 


a VinaxKme 
et ye aK 
max/*mS 


according to Haldene (1930). 


General Michaelis-Menten kinetic for inhibition 
We look at the general reaction scheme: 
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ky kg 
E+S Ez ES —> E+P 
+ a ae 
I I 
k_3{\k3 k_a\\kq 


ks ke 
EI+ 5 = ESI —~ E+P4¢4I 


We distinguish the following cases considering the re- 
lated reaction numbers 1-6: 


1,2 Michaelis-Menten 
1,2,3 Competitive inhibition 
1,2,4 Uncompetitive inhibition 
1-5 Noncompetitive inhibition 
1-6 Partial inhibition 


We can assume the following binding equilibria between 
the entities and the complexes: 


ki BS 
Kn = = aS 
me 
- k3 EI 
Qe 

k4 ESI 
kg abs EES 
2 ie BSI 


The respective reaction kinetics are listed in Table 2. 


Substrate inhibition 
The reaction network for substrate inhibition is: 


E+S = ES + E+P 
+ 
S 
AY 
ESS 
which leads to the kinetic: 
Vmax k_y ES-S 
=kES = ski = —_ = 
ne "ky ESS 


Kn +S (1 + £) 
with the optimal substrate concentration: 
S= V Kinky 


and the optimal velocity: 


Vmax 


Yopt = ——— =. 
TE Tay Ral Re 


An example for substrate inhibition of CYP1A2 (CYP450 
enzyme class) can be found e.g. here [23]. The CYP450 
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Table 2. Types of inhibition for irreversible and reversible Michaelis-Menten kinetics, so you can see that one needs different kinetic laws for different 
approaches. It is not necessary to learn this information by heart, but you might ponder on it for a while. Reproduced from source: [22]. Copyright © 2016, 
John Wiley and Sons. 


Equation - irreversible 


Name Implementation Equation - reversible Characteristics 
Pp q 
Competitive I binds only to free E; ae eg Hoge s yee AES Km changes, Vmax remains; 
ate P-release only from ES =e a as 5 ae S and I compete for the binding 
Piece kia = kes =k =0 Km-i3 +8 Ke wap 8 place; high S may out compete I 
" I binds only to ES; Geer 9 Km and Vmax change, but their 
Uncompetitive P-release only from ES = _VmaxS _ pS is Se Bae ratio remains; S may not out 
inhibition ba Shee 0 Kn+S-ig 1+ GS + cs) ig compete I 
Noncompetitive I binds to E and ES; Vines vhrax _ SV ax a Km remains, Vmax changes; 
inhibition P-release only from ES OS oe ee age vs 5 Pp). S may not out compete I 
Ki3 = Kya, ke =90 ores (145545) is 
Kn, and Vmax change; 
Maxed I binds to E and ES; S Kj,3 > Kji,4 : competitive- 
neuen P-release only from ES = gs sna noncompetitive inhibition; 
inhibition Kis 2 Rack =0 Kin ig +8 -i3 Ki3 < Ki4 : noncompetitive- 
. competitive inhibition 
Partial : re bind es : and io aes hex (1 ih Jy ) Ky, and Vmax change; if ke > ko, 
inhibition -release only from ES and ES v= ailies activation instead of inhibition. 
Ki3 # Kia, ke #0 Kn-i4 +8 +3 


le atg : _ k3 = a ee I SD I 
Abbreviations: Kj,3 = Bo M4= GB = 1+ Ky? 4= qh: Ka 
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enzyme class is unspecific and thus sensitive to substrate 
inhibition. Another consequence might be a drug-drug 
interaction where another drug is processed by the same 
enzyme as the main drug. This is why we talk about 
perpetrator and victim drugs—because of the influence 
of the pharmacokinetics over the other drug. 


Cooperative enzymes 
Regulatory enzymes often have more complicated ki- 
netics and their sub-units display cooperative behavior. 
Cooperativity is the property whereby an enzyme can 
have a steep dependence on the substrate or inhibitor 
[18]. The positive? homotropic”® cooperativity can be 
formulated on the following base for a dimeric enzyme 
Ep [22]: 
Ey + S—> ES 
slow 
EoS + S —> EpSo 
fast 


Because the second reaction is faster, one can assume 
that we have complete cooperativity. This means that 
the dimeric enzyme is either empty or full: 

Ey + 2S —> EySp 


The binding constant: 
- EoS2 
_ E- S2 


represents the equilibrium of concentration levels, and 
the fractional saturation of the enzyme is: 


y — 2E2S2 _ KpS* 
7 2F» total 7 1 + KpS? 


However, this Hill?” equation should be written with the 
parameter h (no integer) instead of n 


h<1 First ligand reduces affinity for the second ligand 
h=1 Quasi Michaelis-Menten 
h> 1 First ligand increases affinity for the second ligand 


It is incorrect to treat h as an estimate of the number of 
substrate-binding sites on the enzyme, though for some 
models it does provide a lower limit for this number [18]. 
Hemoglobin has 4 sub-units with a lower Hill coefficient, 
h= 2.7. Thus we reformulate our equation to: 


.. aes" 
1+ KpS' 
We assume that the fractional enzyme saturation scales 
with the substrate turnover: 
Ks" 


RT ae 


5 Ligand facilitates binding of the next ligand. > 1. 

26 Homotropic means that the allosteric modulator is the substrate. 
When the modulator is not the substrate it is called heterotropic. 

27 English physiologist Archibald Vivian Hill (1886—1977). Nobel 
Prize in medicine for his work on physical properties of muscles in 
1922. 


Table 3. Calmodulin model. Different approaches to 
model cooperativity with and without simplifying as- 
sumptions. Source: [24]. Copyright © 2014, Elsevier 
B.V. 


Model Actual number Effective number 
of parameters of parameters 
(generated by framework) (after assumptions) 

Induced fit (generic) 15 15 

Cooperative model 15 5 

(strong and weak sites) 

Sequential 15 4 

Allosteric 31 7 

Macroscopic 4 4 

Hill's equation 2 2 


The kinetic dependence on the Hill coefficient is visual- 
ized in Figure 15. 


0 0.5 1 1.5 2 2.5 3 
Substrat S 


Figure 15. Hill kinetic. Factural saturation Y with Kz = 
0.3 and different Hill coefficients. 


The Hill equation is a pseudo-mechanistic equation which 
Hill himself saw as purely empirical. However, the use 
of the cooperativity index R, of Taketa and Pogell is more 
empiric (experimentally meaningful) as it focusses on 
the 10-90% range of full activity. A mechanistic formu- 
lation is the Adair?> equation, which is most often used 
for positive cooperativity in practice. Moreover, we have 
the allosteric model or symmetric model*’°°?! with the 
assumption that the subunits of cooperative oligomeric 
proteins are independent of the ligand and all sub-units 
are in the same conformation at any time, while the se- 


28 British protein scientist Gilbert Smithson Adair (1896—1979). 

2° French biochemist Jacques Lucien Monod (1910—1976). Nobel 
Prize in Medicine for genetic control of enzyme and virus synthesis. 
Lac-operon-model. 

30 American biologist and biophysicist Jeffries Wyman (1901—1995). 

31 French neuroscientist Jean-Pierre Changeux (1936—today). 
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quential model** considers ligand-induced changes in 
a sub-unit with possible consequences to others. Thus, 
the sequential model assumes that sub-units show in- 
dependent conformations in contrast to the allosteric 
MWC*® model. Monomeric enzyme can show kinetic 
cooperativity if the different conformational states relax 
slowly. The section is based on the textbook found in 
[18]. Table 3 might give an impression of the model sizes 
for reflecting cooperativity. 


32 American biochemist Daniel Edward Koshland Jr. (1920—2007). 
Together with G. Némethy and D. Filmer in 1966. 
3 Author names: Monod-Wyman-Changeux. 
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2. Basics of Mathematics 


We have a collection of mathematical methods to help us tackle biological questions. They are listed in Figure 16. 
We recommend this review for further reading [25]. 


Testing for differences Significance Analysis Is there a difference between the two groups? 2 
(= 
“before/after” = = : I = 
95% Cl= X Et 5%) SE S 
“with/without” : (+ -n(5%)XSE) a — 
= 
28 y (<D) 
lvsi a Cc lati ivsi Does a small/large value of x, coincide with (ok 
Analysing covariaton orrelation Analysis Bemall/lamevateore s< 
cD) 
“coincides with” Rie Se S 
xy 2 2 
V De Ly (<9) 
Ww 
uw 
Identifying groups Cluster/Discriminant Analysis on ae re ” 
| 
“is similar to” Domina dij) g 
dz a TC 
x 
e . Explain thi ‘iability th h ighted li 
Condensing data Component/Factor Analysis “corhinstion atonal seh igs 
“reduces to” X=U=W" 
Fitting data Regression Analysis 
“relates to” Ve iGrases)| 
F ae ; : : hat i dicted value of y, 
Numerical predictions Time Series Analysis is hi air as a 
y 
“follows” WO)= f(yt-1),p(t-2),.--) ; © 
(e) 
Ti . 
is ; ime 5 
© 
Analyzing influences Bayesian (network) Analysis — From the observed evidence, H, is most probable. = 
Pr(£|H,)Pr(#H,) O ) c 
“Given E, the probability of His” P(H,|E)= <-->] (e) @ x< 
>YPr(E|4,)Pr(H,) O (i) oO 
6 
ae : hi be explained by a (bistabl 
Investigating mechanisms Dynamical Systems Theory ""° beer ana ace istable} al 
S 
“causally entails” d 5 
—x, = f(x,(t),....x,(t),u(t FA ae 
“Vf... Then ...” ae F(x @o--.¥4() é ~~~ < 


Stimulus 
Figure 16. Mathematical analysis approaches. More information and source: [25]. Copyright © Frontiers Physiol, 
Licence: CC BY-SA 4.0. 
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in in 
q C4 


in in 
q CR 


out 
q »Ca, CB 


3. Exercises 


@ Balance equations 

Set up balance equations for the volume and for the 
amount of substance (in mol) of every reaction partner 
for the systems depicted below. The reactions take place 
in an ideally mixed and closed reactor. 


& CSTR 

In the following, a Continuous Stirred Tank Reactor 
(CSTR) is investigated. A substance A with concentra- 
tion c’’ and temperature 7" is pumped into the reactor 
with the volume flow rate q'”. A 1st-order reaction takes 
place, during which Substance A is converted irreversibly 
into Substance B with a reaction rate constant k;. To in- 
crease the conversion rate, heat flow Onea is fed into the 
CSTR. The change in reaction enthalpy is neglectable. 
The volume flow rate g°” (temperature T°”; concentra- 


tions c{“ ce") is removed from the CSTR. 


in gin in 
qe", T™, C4 


a) Set up the balance equations for the amounts of 
Substance A and Substance B. 


out 
q »Ca, CB, Cc 
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out 
q » Ca, CB, Cc 


For the questions below, the following simplifying as- 
sumptions shall apply: g = q!" = q*" and V =const. 


b) Which simplified amount of substance balances 
can you achieve? Which variable are you balancing 
now? 


c) Which concentrations c4,cz do you obtain in steady 
state? 


@ Balancing a CSTR 

Cell-free medium (substrate S, influx concentration ee, 
volume flow rate q) is added to a Continuous Stirred 
Tank Reactor (CSTR) in continuous cultivation mode. 
The reaction volume within the reactor (Vp) is kept con- 
stant. Micro-organisms are growing on the substrate 
within the CSTR. The specific growth rate ps of the micro- 
organisms is approximated with a Michaelis-Menten 
kinetic. The yield coefficient shall be Yys. 


a) Set up mass balance equations for the concentra- 
tions of the biomass (X) and the substrate (S). 


b) Derive the steady-state concentration values (c§, cx) 
as a function of the dilution rate D = D* = — 
R 


§ Intracellular reaction 

We will now focus on enzyme-catalyzed reactions in the 
interior of a bacterial cell with a constant cell volume Vc. 
Thereby a substrate S is converted into Product P: 


ky 
S+E=— ES 
k-1 


ES “3 P+E 
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The reaction is regulated via a feedback repression of the 
enzyme synthesis. A simplified mechanism is assumed. 
as follows: 


C+D hap enzyme synthesis 
D+P—DP 
—4 
E—°>F enzyme degradation 


Within the cell, ideal mixing is assumed. Reaction orders 
are according to the stoichiometric coefficients. 


a) Set up balance equations for the concentrations of 
reaction partners E, ES, S, P, D, and DP in the form 
& (Vc). The bacterial cell is thereby considered as 
a closed system. 


b) Show that the overall concentration of substance 
Dis constant, i.e. cp +cpp =const= cpo 


c) Determine a, 8 and yin: 


BN ae CE 

dt 

dcp _ B k 

dt 1+y:-cp oe 


using the result of part b) and applying the follow- 
ing assumptions: 


1. The concentration of the enzyme substrate 
complex czs is considered to be constant, i.e. 
—crs = 0. 

di ** 

2. Reaction 4 is considered to be in equilibrium 

(quasi-steady state). 


; ‘ i 
3. Concentration cc is constant (cc = c@). 


Remark: The results of points a) to c) are not 
needed for us to solve the following parts. For the 
remaining parts of this exercise, it is assumed that 
a constant substrate concentration cj is achieved 
within the cell via regulation of the substrate trans- 
port: 


d) Which concentrations c7, and cp do you get in steady 
state, if Product P is further converted with rate 
rp(t) =rp? This conversion results in the following 


equations: 
dcg B 
= 5°'CE I 
dt 1+Y-cp 
dcp 
—— =Q-Cs5:CE—rp Il 


dt 
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e) Derive a single 2nd-order differential equation for 
the product concentration cp from Equation I and 
II while applying the assumption cs = c§ and con- 
sidering a time variable rate rp(t). 


repression of enzyme synthesis f) Linearize this differential equation around the steady 


state (cp,rp) with Acp = cp — cp and Arp = rp — rp. 


® Mixed population 

In the following we will focus on a mixed population in 
a CSTR in continuous cultivation. The reaction volume 
V is kept constant. The mixed population consists of 
two species of micro-organisms: prey (P) and predator 
(R). Prey grows with a specific growth rate and de- 
creases proportionally with the number of predator and 
the number of prey. (The more prey is available, the 
more will be depredated). Ideal mixing and a constant 
growth rate (with y 4 dilution rate D = +) is assumed. 


a) Describe the occurring processes in the population. 


b) Assign the individual terms within the following 
equations to the occurring population processes: 


ne = Ucp — kicpcr —Dcp 


—CR= kocrcp — Der 
dt 


c) Determine the steady state(s) and characterize it 
(them) qualitatively. 


d) Linearize the system around the steady state(s). 


e) Calculate the Eigenvalues for the cases: 


1.D=2u 
1 

2.D=-~ 
5H 


f) What can generally be stated on the stability of the 
steady states? Focus therefore on 3 cases (fl > D, 
}t = Dand u < D) for every steady state. 


@ Deriving rate parameters from biological data 


A cartoon model for gene regulation via an extracellular 
signaling molecule (also called a ligand) is given as fol- 
lows: 

L is the signaling molecule, R the membrane receptor, 
A an adaptor protein, and TF the transcription factor 
involved in the regulation. 


From the cartoon model, one derives the biochemical 
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e The parameter k3 has been measured directly as 
k3 = 0.01555. 


The following assumptions, done by the modeler, are 
needed to determine all parameter values uniquely: k_, = 
0.012, k-2 = 1-10-32, and ky = 1-10-94. 


=” 


Nucleus 


reaction network as follows: 


Lees R, I 
k-1 
R+AER, Il 
k_2 
R, + TF 24 TF Wl 
TF == TEn Iv 
k_4 
TFn > TFn +A Vv 
AS VI 


The following information is known from biological mea- 
surements: 


e Experiments are done with 2 mL of medium and 
6- 10° cells, using a ligand concentration of 1 nM. 


The volume of a cell is 1- 10~!2L, with the nucleus 
taking !/s of the total volume. 


The concentration of the transcription factor in 
whole cell extracts (TF + TFn) was measured as 
0.1M. 


The concentration of the transcription factor with- 
out stimulus in nuclear extracts was measured as 
5nM. 


The concentration of the adaptor protein without 
stimulus in whole cell extracts is 1uM. 


The adaptor protein has a half-life time of 1 h. 


A cell has a total of 10000 membrane receptors. 


The associated constant K, for the ligand-receptor 
binding is 0.01 aa Hint: 


e The associated constant for the receptor-adaptor 
protein binding is 0.01 aM 


oe 
wa 


= 


a) Convert the unit molar M to molecules per cell for 
ligand and transcription factor states TF and TFn. 


Calculate the degradation rate constant (k¢) from 
the half-life time 7; /2. 


Construct an ODE model for the sub-network in- 
volving the reactions IV - VI, i.e., the unstimulated 
case. Use, therefore, the law of mass action. 


lo} 
Ww 


Use the biological measurements in the unstimu- 
lated case (steady-state condition) and the assump- 
tions to determine all parameters in reactions IV - 
VI. Use units such that state variables are given as 
numbers of molecules per cell. 


oO 
Ww 


Determine the parameters for reactions I - III from 
the biological measurements and the assumptions, 
using the same units as before. 


Notes 
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in pi 
qf 


in in 
q CR 


out 
q »Ca, CB 


4. Solutions 


Do not betray yourself! 


Exercises 

® Balance equations 

Concentrations in the extraction pipe are assumed to 
correspond to the concentrations within the tank. Thus, 
we have omitted the specific indication with the indices 
out. 


Reactor 1 


dV _ in out 
dt =4q q 
aL 
dt 


Reactor 2 


dV 
dt 
dna 
dt 
dng 
dt 


in in out 


=q"'cx — GCA — ky caV 


= —q" cB +kicaV 


Reactor 3 


_ Lin _in 


=" ch — qe ca —kicaV +k_icpV 


qe" cB tkycaV —k_cpV 


out 
q »Ca, Cp, Cc 


Reactor 4 


dv 
dt 
dna 


out 
q » Ca, CB, Co 


in out 


dt 
ae 
dt 


Reactor 5 


ge ca — 2k AV + 2k_1cBV 


gitcit — cat kiciV —k_cpV 


got _ gr" ca — 2k civ 
—~¢"’cp + kiciV —kocpV 


— gcc + 3kocpV 


ier ge" ca 2kiciV + 2k_1cpV +kocgV 


— k_acZcaV 


qo" cB kicV k_icgV —kocgpV + k_aczcaV 


gcc 2kocpV 2k_oc%c4V 
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in yin in 
qe", T™", Ca 


l, 


qt, Tt, Cay CR 


= CSTR 
Task a) Amount of substance balances: 


dna 
dt 
d 
= =0- gece +kicaV 
Taskb) Applying the simplifying assumptions g = q'” = 
go" und V = const.: 
dn A d 
dt dt 


Applying the product rule and assume that a CSTR does 
not change the volume in the tank during the process 
(derivative of constant V is zero): 


in in 


= ch —ge'ca —kicaV 


(ca-V) =Vea +VEq =0+VEq 


wV- eg = gc — qca —kycaV 


CA = 4 (cir —ca)— ky “CA (4.1) 


V 


CR = 4 -cp tkica (4.2) 


Task c) From the equation (4.1) with c4 = 0: 


qin a x 
(cy — kjc, =0 
y (CA A) 1C4 
yeh Fea) kic, =0 
a (ct 
=> * VVVA 
“A k+¢ 
x q in 
Cc, = 
a o4hv 4 


and for Substance B based on the equation (4.2) with 
cp = 0: 


‘ qx x} 
= tie £0 
CB V CB 104 


q 
7 Cp =kica 
kiV 
CR= oA 
Cc. = kV : q cin 
Bq qtkv 4 
ct _ kV 
B (q+kV) 


@ Balancing a CSTR 

Cell-free medium (substrate S, influx concentration c’”, 
volume flow rate g) is added to a Continuous Stirred 
Tank Reactor (CSTR) in continuous cultivation mode. 
The reaction volume within the reactor (Vp) is kept con- 
stant. Micro-organisms are growing on the substrate 
within the CSTR. The specific growth rate 1 of the micro- 
organisms is approximated with a Michaelis-Menten 
kinetics. The yield coefficient shall be Yxs5. 


Task a) Set up mass balance equations for the biomass 
(X) and the substrate (5) concentrations. 


[e] 


aap Weel wee 
cx VR oo cs 
Cl V; =—_ mM ae V; 
a qcx + LcxVR qcx + UL a+ R 
d(csVp) e 1 
= Cy) — — V 
dt q (cg s) hee R 
With Vp = const. we get 
dcx q 
—* = —2)= —D I 
He Ox G7) = ex(H— D) 
dcs J px 1 Zz 1 
= Zu — D Zu a ee Il 
A ys 8) YH (cf — cs) Yoh 
Task b) Derive the steady-state concentration values 
(c§,cx) as a function of the dilution rate D = D* = 2 
R 
d 
Eq. I: = =cx(u —D) =0 
Case 1: > 
+ in II: D*(ch— ck) =0 = |ch = "| 
CS 
Case 2: = [bag = D* 
we = Unax = ce+Ks 
Cy Lax = D* cs + KsD* 
KsD* 
=> |c§ = _ Il 
Umax za D* 
KsD* 
— in II: D* (a i 5) 
Umax ~~ D 
i KsD 
Umax —D* x 
Umax e Cy = 0 
Yyso K. rae = 
KsD* 
yr G ei 3) 
Umax ei 
_ bra KEP «6 
Yys Ee ee © 
4 KsD* 
>lcy = Yxs (@ = a) IV 
“ 5 Umax —D 


Conditions: 
The substrate concentration cannot be negative (Cj > 0), 
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so that Equation III tells us that: 
=> Umax > D* Vv 


If Lnax < D* we get a wash-out of the biomass, which is 
usually not desired in a CSTR. 
Biomass should also be positive (cy > 0): 


KsD* 


————_ >0 
Umax — D* 


=>c" 


A) 
with max > D*: 


CF Umax — CD — KsD* > 0 
cS + Ks 


lone 


S 


=> Umax > D* VI 


whereby Equation V is contained in Equation VI, be- 
cause: 


(4.3) 


for practical considerations one uses [nay > D. 
In summary: 


CH +K. 
Case 1: Umax > = 
Cs 
= Two steady states 
Ci + K. 
Case?) Pine, = 5 pt 
Cj 
KsD* 
> C= 
&s = D* — D* 
S 
2 KsD*C3" _ 
(c+ Ks)D* —Dtc# 
=> cx =0 
= Both steady states are equal. 
CH +K. 
Case 3: Umax < aR 
Cs 


= It exists only the first steady state 


ZU 


cy = 0;c5* = cy 


Bf Intracellular reaction 


We will now focus on enzyme-catalyzed reactions in the 
interior of a bacterial cell with a constant cell volume Vo. 
Thereby a substrate S is converted into Product P: 


k 
S+E—= ES 
k_] 


ES -25P+E 
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The reaction is regulated via a feedback repression of the 
enzyme synthesis. A simplified mechanism is assumed 
as follows: 


C4D—4.E+D enzyme synthesis 
D+P — DP repression of enzyme synthesis 
—4 
E-°>F enzyme degradation 


Within the cell, ideal mixing is assumed. Reaction orders 
are according to the stoichiometric coefficients. 


1st Part 


Task a) Set up balance equations for the concentrations 
of reaction partners E, ES, S, P,D, and DP in the form 
L(V -c). The bacterial cell is thereby considered as a 
closed system. 


d 
hh (Vc . CE) = (—kicecs +k_\cges +koces + k3cpcc — ksce)Vc 


d 
— (Vc: crs) = (kicgcs — k_1¢gs — kocKs)Vc 


dt 
d 
He -Cs) = (—kicecs +k_1Cgs)Vc 
a (Vc . cp) — (kocEs _ kacpCp + k_acpp)Vc 
d 
or (Vc . cp) — (—kacpcp + k_acpp)Vc 
d 
a (Vc - cpp) = (kacpcp — k-4cpp)Vc 
with: 
dn d(cV) dc dVc . 
— = | =V, th Vc = const. 
a ae a ae eng? eee 
Thus we get: 
d 
ae = —kicecs +k_1 ces +kaces + k3cpcc — ksce I 
aes = kicrecs —k_\cgs — koces su 
g kicrces t+k_\c Ill 
—cg=— z= 
dt S 1CECS 1CES 
d 
Pag = kocgs — kacpCp + k_4cpp IV 
é kacpcp +k_ac V 
Cp = + K 
Heo ACDCp A4CDP 
—Cpp = kacpcp = k_acpp VI 
dt 


which works only if the volume remains constant. 


Task b) Show that the overall concentration of Sub- 
stance D is constant, i.e. cp + cpp =const= cpo 

If you look at the balances of cp and cpp, you see that 
the sum is zero: 

d d 
dt dt 


Cpp = —kacpcp +k_acpp +k4cpcp — k_acpcp = 0. 


Chapter 4: Physical modeling and non-linear enzyme kinetics — 29/36 


By integrating this equation, one obtains: 


Cp +cpp = const. 


Task c) Determine a, B, and yin: 


d 
> VII 
dcg B 

2 ks: Vill 
de Wye 


using the result of part b) and applying the following 
assumptions: 


1. The concentration of the enzyme substrate complex 


ces is considered to be constant, i.e. Hees = 0: 


d 
cbs = hice es — k_1ces — haces = 0 
ky 
Crs = ————CEC. Ix 
ES bilo ECS 
in Equation III: 
dS k_jky 
sana 
at 1CECS + a ay 
Lek Seb eaka 
=> CECS 
k_\ +k 
dS kiko 
= CEC 
a ees 
kik 
a 1k2 
k_j+ko 


2. Reaction IV is considered to be in equilibrium (quasi- 


steady state). 

We already have @ and still need y and B. These 2 
parameters can be found in Equation VIII, because 
this equation contains the derivative of cz. We 


might tinker something together that looks similar. 


Let’s use Reaction I: 


dcge 
dt 


= —kycgecs + (k-1 +ko)ces +k3cpcc — ksce 


Now, we have the problem that we have the terms 
ce and cs, and ces (which we do not want). If we 


put Equation IX in we get: 
dcr k oi k 2 
——$ = — CEC —$§_ ——— CEC. 
dt 1CECS kas ECs 
+k3cpcc — ksce 
d 
= =k3cpcc —kscr 


Well, that is some progress. Now, we take care of 
cp,cc and therefore we have another hint in the 


task description. Let’s use the equilibrium equa- 
tion (IV): 


! 
aoe koces — kacpcp +k_acpp = 0 


kacpcp = k_acpp +k2ces 


We got another hint from task b) where we define 
a total concentration of enzyme-regulating protein 
cpo With cp + cpp = cpo = const. Using this gives: 


kacpcp = k_a(cpo — cp) +k2ces 
(kacp + k_4)ep = k_4cpo + koces 
poe k_acpo + koces 
(kacp + k_a) 
cpo + CEs 


(¢Lep +1) 


CpP = fap CpCp 
4 


3. Concentration cc is constant (cc = cé). 


Finally, we obtain the parameters: 


B =ksccepo 


2nd Part 
We assume now that we give as much substrate as the 
cells consume so that: 


Task d) Which concentrations cz and cp do you get 
in steady state, if Product P is further converted with 
rate rp(t) =r? This conversion results in the following 
equations: 


dce B 

— 5 CE I 
dt 1+Y-cp 
ae Il 
— = -cs:cE—-r, 
dt S°CE P 
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In the stationary case: 


OP = -cs-c8 —rp 20 see= oF 
ere ksce=0 
re isch = ks as 
1+y7 p= PPS 
apc, 1 = aBcs—ks-rp 


Yks-rp ks + rp 


Taske) Derive a single 2nd-order differential equation 
for the product concentration cp from Equation I and II 
while applying the assumption cs = cj and considering 
a time variable rate rp(t). Assume cs(t) = c§. 


From the system equations I and II we get: 


: +ksc 
“¢ = 
jen 1+ Yep 


: a + 
= — | —c,+r 
ie Ge. dee 


We use this to obtain: 


1 (& a d " ks [ad B 
Cc r cp+rp } = —— 
acy de de?) ack at 8) 14 yep 


or 
a d acsB 
k —— ksrp|. 
go a as 
Task f) Linearize this differential equation around the 


steady state (cb, rp) with Acp = cp — cp and Arp = rp — rp. 


Assume cs(t) = c§. 


The linearization of f = ae delivers: 
Hic OF 1 Of 
Nee) FeplesMer = Tye * Dep leer 
1 1 
= -y: Ocp..., 
ity (+r O°? 
with which we get: 
a d (ks-rp)°Y 
—A ks —A ksrp A 
ape OP TMS Ger — NTP YBa OCP 
ae ee nee 
~ ht rp 5\’p T Arp), 
and finally: 
a d (ks -r5)?y d 
—A ks —A Acp = A ksArp|. 
gee ge ape gy 
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& Mixed population 

In the following we will focus on a mixed population in 
a CSTR in continuous cultivation. The reaction volume 
V is kept constant. The mixed population consists of 
two species of micro-organisms: prey (P) and predator 
(R). Prey grows with a specific growth rate and de- 
creases proportionally with the number of predator and 
the number of prey. (The more prey is available, the 
more will be depredated). Ideal mixing and a constant 
growth rate yt (with u ¥ dilution rate D = 4) is assumed. 


Task a) Describe the occurring processes in the popu- 
lation. 
Birth, prey-predator interaction, extraction 


Task b) Assign the individual terms within the follow- 
ing equations to the occurring population processes: 


predation extraction 
—kicrcp —Dcp 


birth 


d 
prey concentration: ae = Ucp 


k2cpcR 
growth on prey 


predator concentration: —Dcr 


d 
—cr= 
dt extraction 


Task c) Determine the steady state(s) and characterize 
it (them) qualitatively. 

0=(u—kicr —D)cp I 

0 = (kocz — D)c II 


Steady State 1: > [cer = 0,c% =O] 


D —D 


Steady State 1 basically says that neither prey nor preda- 
tors are present because they are washed out. 

Steady State 2 exists only if ck, > 0 is. Therefore, the 
growth rate has to compensate at least the dilution rate 
Li > D. If uw < D, we have Steady state 1. 


Taskd) Linearize the system around the steady state(s). 


From Equation I: 
Cp = Ucg — kicpcr — Dep 
we get: 


Cp +Aép = f (cp, ca) +(H—D)- Acp — kip Ace — kc Acp 
~weY — 
0 O(ss) 
Acp = (u —kicr—D) -Acp — k\cp- Acr 


with cp = ch +Acg and cr = cp +Acr. 


From Equation II: 


Cr = kocpcr — Der 
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we get: 


Aér = —D- Acr + kop: Acp + kacp: Acr 
Aér = kocp: Acp + (kocp — D)- Acr 


Steady State 1 (0,0): 


Acp = (u —D) - Acp 
Acr => —D-Acr 


(a) ("0° “o) (act) 


Steady State 2 (2, uP) 


k k 
Aép = (us — 4+ D—D)-Acp — —DAcp = ——DAcp 
ko ko 


k k 
Aér = - (uu — D)Acp + (D—D) + Acr = 7, (HD) Ace 
1 1 


Aép\_ ( HO  —2D\ (Acp 
Acer} \2(u-D) 0 Acr 


Taske) Calculate the Eigenvalues for the cases: 
1.D=2u 
2. D= su 
Steady State 1 (0,0): 
|sf —A| =0 
b—(u-D)\s+0)=0 + [=E=DH==D] 
1.) D=2u 
Ss; =—-U 
s2 = —2p 
=> stable 
il 
sy = 5H 
ol 
s2 = ~ 5H 


=> unstable saddle 


Steady State 2 (2, uP) 


1.) D = 2 Because the growth rate is just the half of the 
dilution rate we will have a wash-out. See task c). 


2.) D=~ 
-)D=5H 


ae ta 
1 

s7.S 

2 5H 


=> unstable saddle 


Steady State 2 (2, a 


1 1 al 
>S52=2 5h sh =i 5H 


=> metastable circle (limit circle). 


Task f) What can generally be stated on the stability 
of the steady states? Focus therefore on 3 cases (ll > D, 
ft = Dand u < D) for every steady state. 


Steady Statel | u>D | p=D u<D 
unstable xX 
stable xX 
metastable Xx 
Steady State2 | u>D | w=D u<D 
unstable Xx not exists 
stable not exists 
metastable x not exists 


@ Deriving rate parameters from biological data 

A cartoon model for gene regulation via an extracellular 
signaling molecule (also called a ligand) is given as fol- 
lows: 


=” 


Nucleus 


Cytosol 


L is the signaling molecule, R the membrane receptor, 
A an adaptor protein, and TF the transcription factor 
involved in the regulation. 


From the cartoon model, one derives the biochemical 
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reaction network: 


eR ee I 
k-1 
ee ee Il 
k_2 
R, + TF 25 TFn an 
TF == TEn Iv 
k_4 
TFn 55 TFn+A V 
AS VI 


The following information is known from biological mea- 
surements: 


e Experiments are done with 2 mL of medium and 
6- 10° cells, using a ligand concentration of 1 nM. 


The volume of a cell is 1-10~!2L, with the nucleus 
taking 1/s of the total volume. 


The concentration of the transcription factor in 
whole cell extracts (TF + TFn) was measured as 
0.1M. 


The concentration of the transcription factor with- 
out stimulus in nuclear extracts was measured as 
5nM. 


The concentration of the adaptor protein without 
stimulus in whole cell extracts is 1uM. 


The adaptor protein has a half-life time of 1h. 


A cell has a total of 10000 membrane receptors. 


The associated constant K, for the ligand-receptor 
binding is 0.01 5,. Hint: 


ky (L]IR] = k1 [Ri 
ky [Ri] 


* ka IR 


The associated constant for the receptor-adaptor 
protein binding is 0.01 mae 


e The parameter k3 has been measured directly as: 
k3 = 0.01545 


The following assumptions, done by the modeler, are 
needed to determine all parameter values uniquely: k_, = 
0.012, k-2 =1-10772, ky =1-10->2 
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Task a) Convert the unit molar M to molecules per cell 
for ligand and transcription factor TF and TFn. 
Ligand: 


f=iMSsie ee Se iosemoweuss 
ieeul 
Mig VS Gxt e110 


= 1.2-10!* molecules in 2 mL 


1.2-10!molecules 6 molecules 
=> — 2 . 1 —_——— 
6- 10°cells ° cell 


Transcription factor: 


ee ee re eee 
L L 
1 
= 5-10-95" .Na-V 
9 mol 3 Molecules 02-10-21 


= 5-10? — -6.022- 107 
5-10-* +— -6.022- 10 


_1602 molecules 
cell 
Taskb) Calculate the degradation rate constant ke from 
the half-life time 7; /: 


A=—kg-A 
=> A(t) =Ao-e *" 
Ti2: A(t) =4o/2 
> a = Ao e kelp 


Task c) Construct an ODE model for the sub-network 
involving the reactions IV - VI, i.e. the unstimulated case. 
Use, therefore, the law of mass action. 


CL = —k, CLCR +1 K_1Cr; I 
CR = —k; CLCR +1 K_1Cri I 
Cri = +k CLCR — K_\ Cri — k2CriCa + k—2CRa il 
CRa = +kocrica — k—2CRa — k3CRaCTF IV 
Ca = —kocrica + k_-2CRa + kscren — koca Vv 
Crp = —k3cracrr — kacrr + k—acrrn VI 


CT Fn = +k3cracrr + kacrr — kK-acrrn Vil 
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Task d) Use the biological measurements in the unstim- 
ulated case (steady-state condition) and the assumptions 
to determine all parameters in reactions IV - VI. Use 
units such that state variables are given as numbers of 


given: T)/.(A) = lh = 3600s 


molecules per cell. > k= a) = WA) 1.9. 10-42 
° Tip(A) 36005 § 
I=Il: —k, -0-cp +k_icp; =0 
=> given: ky = 10~>+4 
TM: ky-O-ch—k_1-0—ky-0+k_»-c%, =0 
=sIlc% =0 V2:> k=k 
° 6 CTFn 
IV: ky -0-ca —k_2:0—k3-0-cpp =0 1 2 1 
Pe ete Fee Pee af Seip 200 
V.2: Ca = —kocrica +k_2CRa + ksctpn — keca = 0 s 602 KY 
VI.2: 9 Crp = —k3cractr — kactp + k_4ctRh = 0 
Vs See = ede ee oa Task e) Determine the parameters for reactions I - III 
from the biological measurements and the assumptions, 
Volume: using the same units as before. 
given: Veell = Va iis © 
= Vive = 107 PL = 0.2- 10-2 1 1 
nuel = 5 =U. given: ka(L—R) =0.015,, k-1 = 0.01; 
4 
= Veto = 5? 10°-°L=0.8-10-7L ;: : eee 
I : * 6 : 
=> Kz = — =0.01— Lig.:1nM=2.- 10 
ae a nM ( reo cell ) 
Concentrations: K,= Ay =—5. -9__ cell 
a 
k_y molecules 
- ecaiac cell 
given: crr +crFn = 0.1UM Sol) molecules 
1 cell 
z —¢ mol = 0.01—-5-10-° ———_ 
=>, =0.1-10 L Na+ Veet s molecules 
CTFTCTFn —5.49-1 cell 
_¢mol molecules _ _ a lecul 
= 0.1-10~® —— - 6.022 - 108 ——_— . 10° @L Borer Ne 
L cell 
— 60220 molecules given: K,(R;—A) = 0.1m considerd only in the cyto- 
cell 


given: crf, = 5nM 


plasma Veyto. K-2 = 10734. 


_ogmol ko L 1 
=>=5-10-°— -N,-V, K,= =0.1 
cer me a 10-Smol Na+ Veyto 
= 5-10-92, 6.032. 10 ee g.210-AL ee: l mol 
L 6.022-102-0.8-10-!2 mol molecules L 
molecules 
= ll 
== =OG 
ee molecules 
molecules 
= = ll 1 
=> CTF == 00220 O02 Ole aa =>k = K,-k_2 =?) 1: 1077 ce . 3 
molecules KY 
given: ca = 1uM =~ 91-19-10 cell 
1 lecul s- molecules 
Sy Sits 19-6 Nievae 602200 
ce 


: : = —51 
given: ky = 107°; 


CTF 


VI2:> k4g=kg- 
CTFn 


1 
nM-s 


given: k3 = 0.01 considered only in the cytoplasma. 


L mol 


cell 


=>k3;=0.01 


agit 
; molecules -s 


10-9mol-s 6.022-102molecules 0.8-10-!2L 
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Parameter Value Unit 
Ka 2.1- 10-7 cell/molecules 
fal 5-10 HM cell/s.molec 
ky 0.01 1/s 
9) 2.1-10 a0 cell/s.molec 
k_o 10-3 I/s 
k3 2.1- 10> cell/s.molec 
kg 10-5 I/s 
k_4 165° I/s 
ks 0.19 I/s 
ke 1.9-10~4 I/s 


Notes 


Notes 


This book need not end here... 
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